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(57) Abstract: A nucleic acid construct is provided. The nucleic acid construct includes (a) an expression cassette including: (i) a 
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NUCLEIC ACIDS AND METHODS FOR DETECTING VIRAL 
INFECTION, UNCOVERING ANTI-VIRAL DRUG CANDIDATES 
AND DETERMINING DRUG RESISTANCE OF VIRAL ISOLATES 

5 

FIELD AND BACKGROUND OF THE INVENTION 

The present invention relates to nucleic acid constructs and methods 
of utilizing same for detecting infection of an RNA virus, for uncovering 
anti-viral drug candidates and for detennining drug resistance of isolates of 

10 an RNA virus. More particularly, the present invention relates to a nucleic 
acid construct which transcribes a minus strand RNA sequence encoding a 
reporter polypeptide and including 5' and 3' sequences of an RNA virus. 
When transcribed in a cell infected with an RNA virus capable of 
replicating the minus strand RNA sequence, a plus strand of this RNA 

15 sequence is formed and translated by the host cell into an active reporter 
polypeptide. 

Viral diseases are some of the major scourges of mankind and 
include such virulent disorders as smallpox, yellow fever, rabies, 
poliomyelitis and AIDS. In addition, viruses carrying oncogenes are 

20 responsible for a number of human tumors and cancers. 

It is a remarkable and proven fact that some virus infections occur 
without overt symptoms, while others can cause more than one clinical 
manifestation involving more than one organ system of the body. This 
lack of a defining clinical manifestation in some infections, presents a 

25 major hurdle to an accurate and timely diagnosis of infections, which in 
some cases is crucial for the prevention of disease and death. 

Several diagnostic procedures have been developed in efforts to 
improve the detection and diagnosis of viral infections. These procedures 
involve the detection of viral components in cells of infected individuals or 

30 the detection of blood components generated as a response to the presence 
of a viral infection. Although such methods provide acceptable accuracy in 
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detecting some viral infections, they are oftentimes expensive and time 
consuming to cany out 

Although accurate and timely diagnosis of some viral infections 
provides clinicians with better chances of combating viral infection, the 
5 lack of suitable anti-viral drugs limits the possibilities of treatment for such 
viral infections 

As such, for the past decades, universities and pharmaceutical 
companies have invested considerable resources in efforts to uncover 
potential anti-viral drug candidates and/or to determine the anti-viral drug 

10 resistance of some viruses. 

Present day anti-viral drug screening methods rely on detecting 
interactions between viral components and molecules having potential 
anti-viral activity. For example, the identification of inhibitors of virally 
encoded proteases ("protease inhibitors") relies on the in-vitro screening of 

15 purified viral protease with chemical compounds in the presence of 
synthetic peptide substrates. Initial in-vitro screening is usually followed by 
a bioassay designed for determining whether a potential protease inhibitor 
or its derivatives function in virally infected cells prior to additional testing 
conducted in more complex biological systems. 

20 Screening for drug resistance of certain virus isolates is typically 

effected by phenotypic testing (plaque reduction assay). This is a labor 
intensive, time consuming and expensive technique that oftentimes does not 
correlate well to the clinical response to drug therapy in individual patients. 
Nonetheless, because of its derivation from testing for sensitivity to 

25 antibacterial agents, this technique is often considered to be the "gold 
standard". 

Prior art drug and drug resistance screening methods, such as the 
methods described above, are further limited in that such methods are not 
readily utilizable in screening for molecules possessing anti-viral activities 
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against, nor can they be utilized to determine the drug resistance of, 
RNA viruses. 

A large portion of the viruses responsible for human diseases are 
RNA viruses. Since the RNA genome of such viruses is replicated via an 
RNA intermediate, recombinant manipulation thereof for the purposes of 
constructing cell, or cell free assays is oftentimes a difficult task. In 
addition, the high heterogeneity of RNA viral genomes further complicates 
recombinant manipulation and also limits the accuracy of prior.art cell free 
drug and drug resistance screenings. 

One example of a disease causing RNA virus is the Hepatitis C virus 
(HCV) which is a member of the Flayiviridae family, and the major cause 
of chronic liver disease worldwide (1, 2). HCV is an enveloped virus with 
a single-stranded, positive sense, RNA genome that encodes a single open 
reading frame (ORF) of about 3010 amino acids (aa) which is 
co-translationally and post-translationally cleaved to give rise to at least 10 
polypeptides (3). Located at its N-terminal end are three structural proteins, 
followed by at least seven non-structural (NS) proteins (1). Combined 
action of host-derived signal peptidases) and the virus-encoded proteases 
are involved in the processing of this polyprotein (4-8). 

Similar to other RNA viruses, the genome of HCV is highly 
heterogeneous, and several genotypes and subtypes have been described 
(12, 13). Numerous studies have successfully demonstrated partial 
replication of the virus in in-vitro culture systems using human T-cells, 
B-cells (9, 10), human hepatocytes (11, 12) or chimpanzee hepatocytes (13, 
14). However, these systems suffer from low viral replication efficiency 
and limited passage cycles. More recently, high level replication of 
subgenomic HCV RNA was established in a human hepatoma cell line that 
would enable long-term production of viral RNA and proteins (14). 
Unfortunately, the complete life cycle of virus does not take place in this 
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system nor are infectable virions produced as transfection with the 
full length genome failed to produce any viable cell clones (14). 

Replication of HCV in vivo involves the replication of its single 
positive-stranded RNA through negative (anti-sense) strand intermediates 
via the NS5B polymerase (15-17). The negative strand RNA formed then 
serves as a template for the synthesis of more positive RNA strands which 
are either used as templates for translation of viral proteins or packaged for 
production of viral particles. Binding and initiation of reverse strand 
synthesis by NS5B is dependent on stem-loop structures present in the 3 ' of 
the viral genome (17, 18). Based on this knowledge the inventors of the 
present invention decided to create a reporter system using constructs 
encoding anti-sense luciferase gene flanked by HCV 5' and 3' NCR. 

While reducing the present invention to practice, a cDNA clone 
encoding a complete HCV genome was generated by the present inventors. 
Sequences derived from this cDNA clone were incorporated in novel 
chimeric HCV-luciferase expression constructs which can be used, 
according to the teachings of the present invention, in accurate and rapid 
cell based assays for detecting HCV infection, screening molecules for 
potential anti-viral activities and determining drug resistance of HCV 
isolates. 

SUMMARY OF THE INVENTION 

According to one aspect of the present invention mere is provided a 
nucleic acid construct comprising: (a) an expression cassette including: (i) a 
first polynucleotide region including a 5' NCR sequence of an RNA virus 
and at least an N-terminal portion of a coding sequence of the RNA virus; 
(ii) a second polynucleotide region including a 3' UTR sequence of the 
RNA virus and at least a C-terminal portion of a coding sequence of the 
virus; and (Hi) a third polynucleotide region encoding a reporter molecule, 
the third polynucleotide region being flanked by the first and the second 
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polynucleotide regions; and (b) a promoter sequence . being 
operatively linked to the expression cassette in a manner so as to enable a 
transcription of a minus strand RNA molecule from the expression cassette. 
According to another aspect of the present invention there is 
5 provided a genetically transformed cell comprising a nucleic acid construct 
including: (a) an expression cassette including: (i) a first polynucleotide 
region including a 5' NCR sequence of an RNA virus and at least an 
N-terminal portion of a coding sequence of the RNA virus; (ii) a second 
polynucleotide region including a 3' UTR sequence of the RNA virus and 

10 at least a C-teiminal portion of a coding sequence of the virus; and (iii) a 
third polynucleotide region encoding a reporter molecule, the third 
polynucleotide region being flanked by the first and the second 
polynucleotide regions; and (b) a promoter sequence being operatively 
linked to the expression cassette in a manner so as to enable a transcription 

15 of a minus strand RNA molecule from the expression cassette. 

According to former features in preferred embodiments of the 
invention described below, the genetically transformed cell former 
comprising an additional nucleic acid construct for expressing at least an 
RNA dependent RNA polymerase of a virus, whereas the first and the 

20 second polynucleotide regions being selected such that foe RNA dependent 
RNA polymerase is capable of replicating foe minus strand RNA molecule 
. into plus strand RNA. 

According to still farther features in foe described preferred 
embodiments at least a portion of foe first polynucleotide region is at least 

25 50 % identical to a sequence encompassed by nucleotides 1-374 of SEQ ID 
NO:33. 

According to still further features in foe described preferred 
embodiments at least a portion of foe second polynucleotide region is at 
least 50 % identical to a sequence encompassed by nucleotides 9158-9609 
30 ofSEQE)NO:33. 
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According to still further features in the described preferred 
embodiments the first polynucleotide region further includes a 5 r UTR 
sequence of the RNA virus. 

According to still further features in the described preferred 
5 embodiments the first polynucleotide region includes an IRES sequence. 

According to still further features in the described preferred 
embodiments the RNA virus is selected from the group consisting of a 
positive strand RNA virus and a negative strand RNA virus. 

According to still further features in the described preferred 
10 embodiments the RNA virus is selected from the group consisting of a virus 
of the picornavirus family, a virus of the togavirus family, a virus of die 
orthomyxovirus family, a virus of the paramyxovirus family, a virus of the 
coronavirus family, a virus of the calicivirus family, a virus of the 
arenavirus family, a virus of the rhabdovirus family and a virus of the 
15 bunyavirus family. 

According to still further features in the described preferred 
embodiments the RNA virus is Hepatitis C. 

According to still further features in the described preferred 
embodiments the first and the second polynucleotide regions are selected 
20 such that the minus strand RNA molecule transcribable from the expression 
cassette is replicatable by an RNA dependent RNA polymerase of the virus 
into a plus strand RNA molecule. 

According to still further features in the described preferred 
embodiments die promoter is functional in a eukaryotic cell. 
25 According to still further features in the described preferred 

embodiments the eukaryotic cell is selected from the group consisting of an 
insect cell, a yeast cell and a mammalian cell. 

According to still further features in the described preferred 
embodiments die reporter molecule is a polypeptide selected from die group 
30 consisting of an enzyme, a fluorophore, a substrate and a ligand. 
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According to yet another aspect of the present invention there is 
provided a method of detecting a presence of an RNA virus in a cell, the 
method comprising the steps of: (a) incubating a nucleic acid construct with 
an extract of the cell under conditions suitable for transcription and 
5 translation of the nucleic acid construct, the nucleic acid construct 
including: (i) an expression cassette having: (one) a first polynucleotide 
region including a 5' NCR sequence of an RNA virus and at least an 
N-terminal portion of a coding sequence of the RNA virus; (two) a second 
polynucleotide region including a 3* UTR sequence of the RNA virus and 

10 at least a C-terminal portion of a coding sequence of the virus; and 

(three) a third polynucleotide region encoding a reporter molecule, the 
third polynucleotide region being flanked by the first and the second 
polynucleotide regions; and (ii) a promoter sequence being operatively 
linked to the expression cassette in a manner so as to direct the transcription 

15 of a minus strand RNA molecule from the expression cassette when the 
nucleic acid construct is incubated with the extract, the first and the second 
polynucleotide regions being selected such that the minus strand RNA 
molecule transcribed is replicatable by the polymerase of the RNA virus 
into a plus strand RNA molecule; and (b) quantifying a level of the reporter 

20 molecule to thereby determine the presence of the virus in the cell. 

According to still further features in the described preferred 
embodiments the reporter molecule is a polypeptide translated from the plus 
strand RNA molecule. 

According to still further features in the described preferred 

25 embodiments the method described above further comprising the step of 
comparing the level of the reporter molecule to that obtained from cells 
free of the virus. 

According to a further aspect of the present invention there is 
provided a method of screening for anti-viral drugs, the method comprising 
30 the steps of: (a) co-incubating a nucleic acid construct, a polynucleotide 
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encoding at least a polymerase of an RNA virus and a potential 
anti-viral molecule under conditions suitable for transcription and 
translation of the nucleic acid construct and the polynucleotide encoding at 
least the polymerase, the nucleic acid construct including: (i) an expression 
cassette having: (one) a first polynucleotide region including a 5 r NCR 
sequence of an RNA virus and at least an N-terminal portion of a coding 
sequence of the RNA virus; (two) a second polynucleotide region including 
a 3' UTR sequence of the RNA virus and at least a C-terminal portion of a 
coding sequence of the virus; and 

(three) a third polynucleotide region encoding a reporter molecule, the 
third polynucleotide region being flanked by the first and the second 
polynucleotide regions; and (ii) a promoter sequence being operatively 
linked to the expression cassette in a manner so as to direct the transcription 
of a minus strand RNA molecule from the expression cassette when the 
nucleic acid construct is incubated with the polynucleotide encoding the 
polymerase of the RNA virus under the conditions suitable for transcription 
and translation, the first and the second polynucleotide regions being 
selected such that the minus strand RNA molecule transcribed is 
replicatable by the polymerase of the RNA virus into a plus strand RNA 
molecule; and (b) quantifying a level of the reporter molecule to thereby 
determine the anti-viral activity of the potential anti-viral molecule. 

According to still further features in the described preferred 
embodiments the reporter molecule is a polypeptide translated from the plus 
strand RNA molecule. 

According to still further features in the described preferred 
embodiments the method described above further comprising the step of 
comparing the level of the reporter molecule to that obtained from cells 
free of the virus. 

According to still further features in the described preferred 
embodiments the potential anti-viral molecule is selected from the group 
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consisting of a nucleoside or nucleotide analogue and an 
immune-modulatory molecule. 

According to still further features in the described preferred 
embodiments step (a) is effected by introducing the nucleic acid construct, 
the polynucleotide encoding at least the polymerase of the RNA virus and 
the potential anti-viral molecule into a cell. 

According to still further features in the described preferred 
embodiments step (a) is effected by introducing the nucleic acid construct 
and the potential anti-viral molecule into a cell infected with the RNA virus. 

According to yet a further aspect of the present invention there is 
provided a method of determining drug resistance of an RNA virus, the 
method comprising the steps of: (a) co-incubating a nucleic acid construct, 
a polynucleotide encoding at least a polymerase of the RNA virus and an 
anti-viral drug molecule under conditions suitable for transcription and 
translation of the nucleic acid construct and the polynucleotide encoding at 
least the polymerase, the nucleic acid construct including: (i) an expression 
cassette having: (one) a first polynucleotide region including a 5' NCR 
sequence of an RNA virus and at least an N-terminal portion of a coding 
sequence of the RNA virus; (two) a second polynucleotide region including 
a 3' UTR sequence of the RNA virus and at least a C-terminal portion of a 
coding sequence of the virus; and (three) a third polynucleotide region 
encoding a reporter molecule, the third polynucleotide region being flanked 
by the first and the second polynucleotide regions; and (ii) a promoter 
sequence being operatively linked to the expression cassette in a manner so 
as to direct the transcription of a minus strand RNA molecule from the 
expression cassette when the nucleic acid construct is incubated with the 
polynucleotide encoding at least the polymerase of the RNA virus under the 
conditions suitable for transcription and translation, the first and the second 
polynucleotide regions being selected such that the minus strand RNA 
molecule transcribed is replicatable by the polymerase of the RNA virus 
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into a plus strand RNA molecule; and (b) quantifying a level of the 
reporter molecule to thereby determine me resistance of the RNA virus to 
the anti-viral drug. 

According to still further features in the described preferred 
5 embodiments the method described above further comprising the step of 
comparing the level of the reporter molecule to mat obtained from cells 
free of the anti-viral drug. 

According to still further features in the described preferred 
embodiments the reporter molecule is a polypeptide translated from the plus 
10 strand RNA molecule. 

According to still further features in the described preferred 
embodiments the anti-viral drug is selected from the group consisting of a 
nucleoside or nucleotide analogue and an immune-modulatory molecule. 

According to still further features in the described preferred 
15 embodiments step (a) is effected by introducing the nucleic acid construct, 
the polynucleotide encoding at least the polymerase of the RNA virus and 
the anti-viral drug into a cell. 

According to still further features in the described preferred 
embodiments step (a) is effected by introducing the nucleic acid construct 
20 and the anti-viral drug into a cell infected with the RNA virus. 

The present invention successfully addresses the shortcomings of the 
presently known configurations by providing nucleic acid constructs and 
methods of utilizing same for detecting the presence of an RNA virus in a 
25 cell or a cell extract, for uncovering novel anti-viral drugs and for 
determining the resistance of RNA virus isolates to anti-viral drugs. 



30 



BRIEF DESCRIPTION OF THE DRA WINfiS 

The invention is herein described, by way of example only, with 
reference to the accompanying drawings. With specific reference now to 
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the drawings in detail, it is stressed that the particulars shown are by 
way of example and for purposes of illustrative discussion of the preferred 
embodiments of the present invention only, and are presented in the cause 
of providing what is believed to be the most useful and readily understood 
5 description of the principles and conceptual aspects of the invention. In this 
regard, no attempt is made to show structural details of the invention in 
more detail than is necessary for a fundamental understanding of the 
invention, the description taken with the drawings making apparent to those 
skilled in the art how the several forms of the invention may be embodied 
io in practice. 

In the drawings: 

FIG. 1A is a schematic representation of the overlapping HCV 
cDNA clones of HCV-S1 utilized in constructing the HCV genome. The 
positions of the first and last nucleotides and amino acids of the individual 
15 HCV proteins as well as the first and last nucleotide of the HCV 5* UTR 
and 3' UTR are indicated. Clones A-M represent the overlapping cDNA 
clones of HCV-S1 obtained from RT-PCR. The first and last nucleotide of 
each clone is indicated. 

FIG. IB illustrates the step employed for constructing the sense and 
20 antisense chimeric vectors of the present invention. 

FIGs. 2A-C illustrate the protein products of in vitro translation 
experiments of HCV constructs separated on SDS-PAGE. Figure 2A - 
translation of the entire non-structural HCV polyprotein from 
pcDNA3(NSP). Figure 2B - translation of the entire structural HCV 
25 polyprotein from pcDNA(SP). Figure 2C - translation of the foil length 
HCV genome from pcDNA3(Sl). CPMM represents incubation with 
canine pancreatic microsomal membranes. Arrows indicate positions of 
autolytically cleaved products upon prolonged incubation. Molecular 
weight marker sizes (in kDa) are indicated on the left 
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FIGs. 3A-G illustrate western analysis of 293T cells transiently 
transfected with pXJ41(Sl). Cells were harvested two days 
post-transfection and lysate proteins were separated on SDS-PAGE gels and 
transferred onto nitrocellulose membranes. The blotted proteins were 
probed with anti-E2 (Figure 3A), anti-NS3 (Figure 3B) and anti-NS5A 
(Figure 3C) monoclonal antibodies. The detection of core (Figures 3E-G) 
and NS5B (Figures 3D-E) proteins, was effected using different sera from 
HCV infected patient at a dilution of 1:100 (Figures 3D-G). The 
immunoblots of Figures 3D-E represent sera taken from the same patient 
from which the HCV-S1 was cloned*. Molecular weight marker sizes (in 
kDa) are indicated on the left. 

DESCFJPTION OF THE PREFERRED EMBODIMENTS 

The present invention is of nucleic acid constructs and methods 
utilizing same which can be utilized for detecting infection of an RNA 
virus, for uncovering anti-viral drug candidates and for determining drug 
resistance of isolates of an RNA virus. Specifically, the present invention is 
of a nucleic acid construct which transcribes a minus strand RNA sequence 
encoding a reporter polypeptide and including 5' and 3' sequences of an 
RNA virus. When transcribed in a cell infected with an RNA virus capable 
of replicating the minus strand RNA sequence, a plus strand of this RNA 
sequence is formed and translated by the host cell into an active reporter 
polypeptide. 

The principles and operation of the present invention may be better 
understood with reference to the drawings and accompanying descriptions. 

Before explaining at least one embodiment of the invention in detail, 
it is to be understood that the invention is not limited in its application to 
the details of construction and the arrangement of the components set forth 
in the following description or illustrated in the drawings. The invention is 
capable of other embodiments or of being practiced or carried out in various 
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ways. Also, it is to be understood that the phraseology and terminology 
employed herein is for the purpose of description and should not be 
regarded as limiting. 

The molecular studies of the pathogenesis of HCV and the 
development of anti-viral drugs have been hampered in part by the lack of a 
robust, cell-based assay to monitor viral replication. The currently available 
cell-based systems are limited by the low viral replication efficiency and 
limited passage cycles. Although high levels of replication of subgenomic 
HCV RNA was established in a human hepatoma cell line that would 
enable long-term production of viral RNA and proteins, this does not truly 
measure viral replication. The complete life cycle of HCV does not occur in 
this system, nor are infectable virions produced. Moreover, the authors 
failed to generate any viable cell clones when they carried out transfections 
with the full length genome (14). 

Replication of the HCV genome in vivo is dependent in part on the 
proteolytic activity of host signal peptidase(s) for cleavage of its structural 
genes and on its NS3 protein, which systematically cleaves the viral NS 
polyprotein to release the individual active subunits (7). Of these, the viral 
RNA dependent RNA polymerase, NS5B, plays a vital role in replication 
through synthesis of both positive and negative viral RNA strands (15). 
Due to the low replication efficiency of HCV, nested RT-PCR for 
amplifying minus-strand RNA is employed to determine viral replication in 
vivo. This method is both laborious and easily prone to false positive 
errors. Although its sensitivity and reliability has been improved with the 
use of tagged primers and Tth polymerase (13), it still remains expensive 
and time-consuming. 

As is further described in the Examples section which follows, to 
generate a reliable and simple reporter assay system Which can be utilized 
to detect hepatitis C virus (HCV) replication in vivo, and to uncover novel 
anti-viral drugs as well as to screen for drug resistance in viral isolates, the 



WO 02/08447 PCT/IL01/00669 

14 

present inventors undertook the laborious task of generating a 
replication-competent foil length HCV genome. 

Sequences derived from this clone were then utilized to generate 
reporter expression constructs which produce a reporter signal in the 
5 presence of infecting virus particles. 

Thus, according to one aspect of the present invention there is 
provided a nucleic acid construct The nucleic acid construct includes an 
expression cassette having a first polynucleotide region including a 5' NCR 
sequence of an RNA virus and at least an N-terminal portion of a coding 

10 sequence of the RNA virus, such as for example the N-terminal portion of 
the core sequence, and a second polynucleotide region including a 3' UTR 
sequence of the RNA virus and at least a C-terminal portion of a coding 
sequence of the virus, such as for example a C-terminal portion of the viral 
polymerase sequence. The expression cassette also includes a third 

15 polynucleotide region which encodes a reporter polypeptide such as for 
example, an enzyme, a substrate, a ligand or receptor or a fluorophore. 

According to the present invention, the reporter molecule encoding 
region is flanked by the first and the second polynucleotide regions and is in 
transcriptional linkage therewith. 

20 The nucleic acid construct according to this aspect of the present 

invention, also includes a promoter sequence which serves to direct 
transcription of the expression cassette sequence in eukaryotic cells such as 
for example, mammalian cells, yeast cells or insect cells. 

The promoter sequence is oriented with respect to the expression 

25 cassette sequence, such that transcription therefrom generates a minus 
strand RNA molecule. 

As used herein the phrase "minus (or negative) strand RNA" refers 
to the complementary RNA strand of the "plus (or positive) strand RNA" 
which is the strand typically translated by the ribosomes into a polypeptide 

30 sequence. 
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According to a preferred embodiment of the present 
invention, at least a portion of the first polynucleotide region is at least 50 
%, at least 60 %, at least 70 % at least 80 %, at least 90 to 95 % identical to 
a sequence encompassed by nucleotides 1-374 of SEQ ID NO:33. 
5 According to another preferred embodiment of the present invention, 

at least a portion of the second polynucleotide region is at least 50 %, at 
least 60 %, at least 70 % at least 80 %, at least 90 to 95 % identical to a 
sequence encompassed by nucleotides 9158-9609 of SEQ ID NO:33. 

Since the nucleic acid construct of the present invention transcribes a 

10 minus strand RNA molecule in cells, such a construct cannot generate an 
active reporter molecule in cells transformed with this construct. However, 
in the presence of a viral polymerase, such as the RNA dependent RNA 
polymerase encoded by RNA viruses (hereinafter RNA polymerase), such 
as the case when the transformed cell is infected with a virus or expresses 

is the viral polymerase, replication of the minus strand RNA takes place and a 
plus strand RNA molecule is formed. This molecule can then be translated 
by the host cell ribosome into an active reporter molecule. It will be 
appreciated that this is true only in cases where the viral RNA polymerase 
binds and initiates replication from the viral sequences included within the 

20 transcribed minus strand RNA. In most cases, the viral sequences utilized 
in the expression cassette of the nucleic acid construct will be derived from 
the virus of interest, although in some cases, RNA polymerases of one virus 
can replicate RNA which includes 5* and 3' sequences from another virus. 
Since the sequences regulating RNA replication in RNA viruses 

25 reside in the 5' and 3' NCRs and/or UTRs, such sequences alone are often 
sufficient in promoting RNA replication of the minus strand RNA 
transcribed from the nucleic acid construct of the present invention. 
However, not withstanding from the above, in some RNA viruses, coding 
region sequences are often necessary in order to initiate or enhance 

30 replication, as is the case for HCV. As such, the expression cassette 
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according to the present invention preferably also includes such 
sequences, the identity thereof can be determined by quantifying replication 
from various expression cassettes which include different segments from 
the coding region of the virus. 

Since the cap dependent translation of RNA in virally infected cells 
is oftentimes downregulated by the presence of a replicating virus, the 
expression cassette preferably also include internal ribosome entry site 
(IRES) sequences for initiation of cap independent translation of the 
chimeric polypeptide(s)if such sequences are not already included within 
the 5' and 3' sequences. 

The viral sequences included in the expression cassette according to 
the present invention, are derived from a plus strand RNA virus or a minus 
strand RNA virus such as for example a virus of the picornavirus family, a 
virus of the togavirus family, a virus of the orthomyxovirus family, a virus 
of the paramyxovirus family, a virus of the coronavirus family, a virus of 
the calicivirus family, a virus of the arenavirus family, a virus of the 
rhabdovirus family or a virus of the bunyavirus family. 

According to another preferred embodiments of the present 
invention, the RNA virus is a Hepatitis C virus (HCV). 

The nucleic acid construct described hereinabove can be constructed 
using commercially available mammalian expression vectors or derivatives 
thereof. Examples of suitable vectors include, but are not limited to, 
pcDNA3, pcDNA3.1(+/-), pZeoSV2(+/-), P SecTag2, pDisplay, 
pEF/myc/cyto, pCMV/myc/cyto, pCR3.1, which are available from 
Invitrogen, pCI which is available from Promega, pBK-RSV and 
pBK-CMV which are available from Stratagene, pTRES which is available 
from Clontech, and their derivatives and modificants. 

Any of the promoter and/or regulatory sequences included in the 
mammalian expression vectors described above can be utilized to direct the 
transcription of the expression cassettes described above. However, since 
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such vectors are readily amenable to sequence modifications via 
standard recombinant techniques, additional regulatory elements, promoter 
and/or selection markers can easily be incorporated therein if needed. 

The nucleic acid construct according to this aspect of the present 
5 invention can be utilized in a cell-based or a cell free assay to detect virus 
infection of a cell, to uncover novel anti-viral drugs or to determine the 
resistance of an RNA virus isolate to anti-viral drugs. 

When utilized in cell-based assays, the nucleic acid construct is 
introduced into a cell via any standard transformation method- Numerous 

io methods are known in the art for introducing exogenous polynucleotide 
sequences into eukaryotic cells. Such methods include, but are not limited 
to, direct polynucleotide uptake techniques, and virus or liposome mediated 
transformation (for further detail see, for example, "Methods in 
Enzymology" Vol. 1-317, Academic Press). Bombardment of cells or cell 

15 cultures. 

A genetically transformed cell including the nucleic acid construct of 
the present invention either stability integrated into it's genome, or 
transiently expressed can be utilized for a cell-based assay. In assays 
designed for uncovering novel anti-viral drugs or determining the resistance 

20 of an RNA virus isolate to anti-viral drugs, such a cell can further be 
genetically transformed to also express an RNA polymerase of a virus of 
interest along with other viral proteins and as such serve as a "test bed" for 
various molecules of interest 

Thus, the nucleic acid construct of the present invention can be 

25 utilized in a method for detecting a presence of an RNA virus in a cell by 
incubating the nucleic acid construct with an extract of cell or by 
introducing the construct into the cell and measuring the signal from the 
reporter molecule. Preferably, this signal is compared to a signal measured 
from a cell infected with a virus and possibly also a cell not infected with 

30 the virus to thereby determine the presence of the virus in the cell. 
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As mentioned hereinabove, the nucleic acid construct of the 
present invention can be utilized in an assay designed for screening 
anti-viral activities of various molecules or in an assay for determining the 
drug resistance of an RNA virus isolate. Such assays are separately 
5 effected by incubating the nucleic acid construct and a potential anti-viral 
drug when screening molecules for anti-viral activities, or a known 
anti-viral drug when determining drug resistance of an RNA virus along 
with a cellular extract from an infected cell. Alternatively the constructs 
and potential or known drug are introduced into an infected cell or a cell 

10 expressing the viral polymerase and possibly other viral components. 

Following a predetermined time period, the reporter activities are 
measured and preferably compared to those measured from cells not 
including the potential or known drug to thereby determine the anti-viral 
activity of the drug candidate or to determine the resistance of the virus to 

15 the known anti-viral drug. 

It will be appreciated that although cell-free assays (in-vitro) can be 
efficiently utilized for detennining the anti-viral activity of a drug candidate 
or for detennining the resistance of the virus to the known anti-viral drug 
cell-based assays (in-situ) screening in virally infected cells is preferred 

20 since this method determines anti-viral activity in-situ and in the presence 
of all the virally expressed components aid as such it is more accurate in 
predicting future activity of screened molecules m-vivo. 

Thus, the present invention provides nucleic acid constructs and 
methods of utilizing same to detect viruses in infected cells, to screen and 

25 uncover potential anti-viral drugs and to determine drug resistance of virus 
isolates. 

The present invention presents several advantages over prior art 
methods. It is easily to implementable and executable, and in addition 
when utilized for uncovering potential viral drugs and for drug resistance 
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screening it can provide results of an accuracy which far exceeds that 
achieved by presently available in-vitro methods. 

Additional objects, advantages, and novel features of the present 
invention will become apparent to one ordinarily skilled in the art upon 
examination of the following examples, which are not intended to be 
limiting. Additionally, each of the various embodiments and aspects of the 
present invention as delineated hereinabove and as claimed in the claims . 
section below finds experimental support in the following examples. 

EXAMPLES 

Reference is now made to the following examples, which together 
with the above descriptions, illustrate the invention in a non limiting 
fashion. 

Generally, the nomenclature used herein and the laboratory 
procedures utilized in the present invention include molecular, biochemical, 
microbiological and recombinant DNA techniques. Such techniques are 
thoroughly explained in the literature. See, for example, "Molecular 
Cloning: A laboratory Manual" Sambrook et al., (1989); "Current Protocols 
in Molecular Biology" Volumes I-m Ausubel, R. M., ed. (1994); Ausubel 
et al., "Current Protocols in Molecular Biology", John Wiley and Sons, 
Baltimore, Maryland (1989); Perbal, "A Practical Guide to Molecular 
Cloning", John Wiley & Sons, New York (1988); Watson et al., 
"Recombinant DNA", Scientific American Books, New York; Birren et al. 
(eds) "Genome Analysis: A Laboratory Manual Series", Vols. 1-4, Cold 
Spring Harbor Laboratory Press, New York (1998); methodologies as set 
forth in U.S. Pat. Nos. 4,666,828; 4,683,202; 4,801,531; 5,192,659 and 
5,272,057; "Cell Biology: A Laboratory Handbook", Volumes I-JJJ Cellis, 
J. E., ed. (1994); "Current Protocols in Immunology" Volumes I-IJJ Coligan 
J. E., ed. (1994); Stites et al. (eds), "Basic and Clinical Immunology" (8th 
Edition), Appleton & Lange, Norwalk, CT (1994); Mishell and Shiigi (eds), 
"Selected Methods in Cellular Immunology", W. H. Freeman and Co., New 



) 
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York (1980); available immunoassays are extensively described in the 
patent and scientific literature, see, for example, U.S. Pat. Nos. 3,791,932; 
3,839,153; 3,850,752; 3,850,578; 3,853,987; 3,867,517; 3,879,262; 
3,901,654; 3,935,074; 3,984,533; 3,996,345; 4,034,074; 4,098,876; 
4,879,219; 5,011,771 and 5,281,521; "Oligonucleotide Synthesis" Gait, M. 
J., ed. (1984); "Nucleic Acid Hybridization" Hames, B. D., and Higgins S. 
J., eds. (1985); "Transcription and Translation" Hames, B. D., and Higgins 
S. J., eds. (1984); "Animal Cell Culture" Freshney, R. L, ed. (1986); 
"ImmobUized Cells and Enzymes" ERL Press, (1986); "A Practical Guide to 
Molecular Cloning" Perbal, B., (1984) and "Methods in Enzymology" Vol. 
1-317, Academic Press; "PCR Protocols: A Guide To Methods And 
Applications", Academic Press, San Diego, CA (1990); Marshak et al, 
"Strategies for Protein Purification and Characterization - A Laboratory 
Course Manual" CSHL Press (1996); all of which are incorporated by 
reference as if folly set forth herein. Other general references are provided 
throughout this document. The procedures therein are believed to be well 
known in the art and are provided for the convenience of the reader. All the 
information contained therein is incorporated herein by reference. 

EXAMPLE 1 
MATERIALS AND METHODS 
Clinical characteristics of the recipient patient: 
Sera from an individual known to be suffering both thalessemia and 
chronic hepatitis C was used for RT-PCR to obtain overlapping clones 
comprising the full length HCV genome (SEQ ID NO:33). Sera was 
collected from the patient after bone marrow transplantation upon diagnosis 
of elevated levels of serum transaminase indicative of HCV reactivation. 
The patient was determined to be HCV positive by RT-PCR of the plasma 
using the branched DNA assay with a level of 352 Meq/ml (Quantiplex 
HCV RNA assay, version 2.0 (bDNA); Chiron Diagnostics). Serum 
samples were collected in 400 ul aliquots and stored at -80 °C. 
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Isolation of HCVRNA: 

RNA was extracted from 400 ul of sera using 1.2 ml of the Trizol LS 
reagent (Gibco BRL, Gaithesburg MD, USA). The mixture was inverted 
for 20 seconds at room temperature (RT), 0.35 ml of chloroform was added 
5 and me mixture inverted again for 20 seconds. The mixture was allowed to 
stand at RT for 5 minutes following which it was centrifuged at 12 000 rpm 
for 20 minutes. The upper phase of the mixture was transferred to a new 
microfuge tube, 0.8 ml of isopropanol and mixing was effected via 
inversion. The tube was left at RT for 5 minutes following which it was 
10 spun again at 12 000 rpm for 20 minutes at 4 °C. The RNA pellet was 
air-dried and re-suspended in 50 ml of DEPC-treated water. 

RT-PCR: 

Several RT-PCR reactions were conducted in order to obtain the 
various overlapping cDNA fiagments. The various RT-PCR utilized are 

is listed in Table 1. The RNA, extracted as described above, was reverse 
transcribed at 42 °C for 1 hour using 100 ng of oligo(dT) and/or specific 
antisense primers and 200 U of Superscript n polymerase (Gibco BRL, 
Gaithersburg). The resultant cDNA samples were heated at 70 °C for 15 
minutes and PCR amplified using the Expand High Fidelity PCR System 

20 (Boehringer Mannheim). The PCR reactions were performed with 2-5 pi of 
template in a total volume of 50 pi. Different cycling profiles were used 
depending on the target length and the melting temperature (Tm) of the 
primers. Generally the PCR conditions were as follows: a hot-start at 95 °C 
for 3 min, denaturation at 95 °C for 1 min, annealing at 45-65 °C for 1 min, 

25 and extension at 68 °C for 1 min per 1 kb of amplified cDNA. At the end of 
30-35 cycles, a final extension was carried out at 68 °C for 8 minutes. In 
several cases nested PCR was carried out to obtain the HCV cDNA 
fragment (Table 1). 
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5> RACE: 

To clone the 5' UTR of HCV, a 5' rapid amplification of cDNA ends 
method using the 573' RACE kit from Boehringer Mannheim was 
employed. The first strand cDNA was synthesized with the antisense 
5 primer H3 (Table 1) and AM reverse transcriptase at 55 °C for 1 hour and 
the resultant cDNA was purified using the High Pure PCR Product 
Purification kit (Boehringer Mannheim). A terminal transferase was 
utilized for 3' dA-tailing of the purified cDNA sample following which the 
transferase was heat-inactivated at 70 °C for 10 minutes. The tailed cDNA 

10 was amplified using the oligo dT-anchor primer and the H29 and the gene 
specific H4 primers (Table 1) utilizing the Expand High Fidelity PCR 
system. PCR conditions were as follows: 95 °C for 3 minutes, followed by 
35 cycles of 95 °C for 1 minute, 45 °C for 1 minute, 68 °C for 1 minute, and 
a final extension at 68 °C for 8 minutes. A second round of PCR was 

15 performed with 1 ml of the first reaction mixture and me PCR anchor 
primer and the H30 and H5 primers (Table 1). The PCR products were 
cloned into the pCRII TOPO plasmid using the TOPO TA cloning kit from 
Clontech (Carlsbad, CA, USA). 

Construction of HCV-S1 cDNA clones encoding the structural 

20 proteins: 

The region spanning the 5' non-coding region (NCR) including the 
p7 region (nucleotides -276 to 2461 in Figure 1A) was PCR amplified using 
clones C and D as templates and primers H2 and H12 (Table 1). The 
resulting 2.7 kb PCR product (nucleotides 65-2802 of SEQ ID NO:33) and 

25 a 600 bp PCR product comprising the NS2 cDNA (nucleotides 2769-3369 
of SEQ ID NO:33) were used as templates for the H2 and H32 primers in a 
second round of PCR amplification (Table 1) to produce a 3.3 kb DNA 
fragment (nucleotides 65-31 14 of SEQ ID NO:33). This PCR product and 
clone A were used as templates in a third round of PCR with primers H30 

30 and H32. The resultant PCR product (nucleotides 1-3114 of SEQ ID NO: 
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33) was cloned into pXL TOPO TA vector from Clontech (Carlsbad, 
CA, USA) to generate clone J (Figure 1A). The truncated NS2 PCR 
product was amplified from clone E (Figure 1 A) using Ihe primers H3 1 and 
H32. The PCR conditions were as follows: hot-start at 95 °C for 3 min, 

5 denaturation at 95 °C for 1 minute, annealing at 60-65 °C for 1 minute, and 
extension at 68 °C for 1 minute per 1 kb of amplified cDNA. At the end of 
30 cycles, a final extension step was carried out at 68 °C for 8 minutes. 
Clone J was digested with EcoRI and re-cloned into pcDNA3.1(+) 
(Invitrogen) and pXJ41neo (Gift from C. Pallen, IMCB, 20) and correctly 

10 oriented clones were selected. 

Table 1 - Sequences of primers used for PCR amplification of overlapping 



cDNA regions of the genome of HCV isolate HCVS1. 



Primer 


Sequence (S*-3*) 


Position 


Sense/Anti* 
ease 


Reference 


HI 


ACTGTCTTCACGCAGAAAGCGTCTAGC 
CAT (SEQ ID NO: 1) 


-285 to -256 


sense 


Bukfa et at 
1992 


H2 


CACGCAGAAAGCGTCTAGCCAT (SEQ 
IDN02) 


-276 to -247 


sense 


Bukb et al. 
1992 


H3 


cgagacctcccckkkx:actcgcaagca 
ccc(seqidn0-3) 


-14 to -43 


anti-sense 


Bukfa et al. 
1992 


H4 


TCCCGGGGCACTCGCAAGCACCCTATC 
AGG (SEQ ID NO:4) 


-2110-50 


anti-sense 


Bukfa et al 
1992 


H5 


CTATCAGGC^GTACCACAAGGCCTTTC 
GCG(SEQJDN05) 


-4310-72 


anti-sense 




H6 


CCCGCY AGGACYCCCCAGTGG (SEQ JD 
NO.6) 


1073 to 1053 


anti-sense 


Dirsel et aL 
1994* 


H7 


GCCCAGTTCCXXACCATGGA (SEQ ID 
NO:7) 


1106 to 1087 


antt-sense 


Dirsel et aL 
1994* 


H8 


AGGGCAGTCCTGTTGATGTGC (SEQ ID 
NO:8) 


1280 to 1260 


anti-sense 


Dirse] et al 
1994* 


H9 


AGGCTATCATTGCAGTTCAGGGC (SEQ 
IDN09) 


1298 to 1276 


anti-sense 




H10 


CCACTGGGGRGTCCTRGCGGG (SEQ ID 
NO:10) 


1053 to 1073 


sense 


Dirsel et al 
1994* 


Hll 


TCCATGGTGGGGAACTGGGC (SEQ ID 
NO:ll) 


1087 to 1106 


sense 


Dirsel et al 
1994* 


H12 


CXKXKXXK^CXIATGCAGCCAT (SEQ ID j 


2461 to 2440 


anti*sense 




H13 


AGTACCAGACCTATGAAAACCGC (SEQ 
IDNai3) 


2483 to 2461 


anti-sense 
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H14 


ATGGACCGGGAGATGGCTGCA (SEQ ID 
NO:14) 


2428 to 2448 


sense 




H15 


AGGCTTTAGCCGTGTGAGACA (SEQ ID 
N0:15) 


4848 to 4828 


anti-sense 




H16 


GCGCCY ATCACGGCCTACTCC (SEQ ID 
NO.16) 


3079 to 3099 


sense 


- 


H17 


GACGACCTCCAGGTCAGCCGA (SEQ ID 
N0:17) 


4968 to 4948 


anti-sense 


- 


H18 


ACGCCCACTTCTTGTCTCAGA (SEQ ID 
NO:18) 


4706 to 4726 


sense 


- 


H19 


ACTAAGCAGGCAGGAGACAAC (SEQ ID 
NO:I9) 


4726 to 4746 


sense 


- 


H20 


TTGATGGGTAATTTGCTCTCC (SEQ ID 
NO20) 


7328 to 7308 


anti-sense 


- 


H21 


GTGGTGACGCAGCAAGGAGTT (SEQ ID 
N021) 


7359 to 7339 


anti-sense 


- 


H22 


CAGCGACGGGTCITGGTCTAC (SEQ ID 
NCh22) 


7200 to 7220 


sense 




H23 


TCACCGGTTGGGGAGCAGATAG (SEQ 
IDNO:23) 


9033 to 9012 


anti-sense 


- 


H24 


TCTACGGGGCCTACTACTCCATr (SEQ 
IDN024) 


8597 to 8619 


sense 


- 


H25 


CTACTACTCCATTGAGCCACTTGAC 
(SEQIDN025) 


8607 to 8631 


sense 




H26 


ACATGATCTGCAGAGAGGCCAGTATCA 
GCACTCTC (SEQ IDND26) 


9269 to 9234 


anti-sense 


Tanaka et al. 
1996 


H27 


GTCAAGTGGCTCAATGGAGTAGTAGGC 
(SEQIDN027) 


8631 to 8605 


anti-sense 


- 


H28 


GCCAGCXCXXGATTGGGGGCGACACTC 
CACCATAGATC^CTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGGGTCTA 
GCCA(SEQIDN028) 


-34Ito-256 


sense 


- 

• 


H29 


GACCACGCGTATXXjATGTCXjACTTTT^ 
JiiiJIlinV (SEQ IDN029) 




sense 


573* RACE kit 


H30 


GACXACGCGTATCGATGTCXjAC (SEQ ID 
NO30) 




sense 


573' RACE kit 


H31 


ATGGACCAGGAGTTGGCTGCATCGTGC 
(SEQIDNOSl) 


2769to-2796 


sense 




H32 


CTAACGCGCACGCACGAATGAGGCCTT 
(SEQIDN032) 


3115 to 3008 


mti-5ense 





Construction of HCV-S1 cDNA clones encoding the NS proteins: 

The region spanning NS3 to NS5A (nucleotides 3420-7669 of SEQ 
ID NO:33) was obtained by double-cloning a 1.844 kb BamHI/Bmrl 
fragment (nucleotides 3420-5263 of SEQ ID NO:33) from clone F (Figure 
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1A) and a 2.4 kb Bmrl/EcoRV fragment (nucleotides 5263-7669 
of SEQ ID NO:33) from clone G (Figure 1A) into pKSB (+/-) digested with 
BamHI and EcoRV. The resulting clone was digested with Xbal and BsrGI 
and ligated to a 0.9 kb Xbal/BsrGI fragment (nucleotides 2769-3640 of 
5 SEQ ID NO:33) containing the NS2 ORF from clone E, to thereby produce 
clone K (Figure 1A). To generate the region spanning nucleotides 7200 to 
9268 of the HCV genome, clones H and I (Figure 1A) were used as 
templates in a PCR reaction with primers H22 and H26 (Table 1). The 
resultant PCR product (nucleotides 7641-9609 of SEQ ID NO:33) was 

10 cloned into pCPJITOPO to generate clone L (Figure 1A). Clones K and L 
were each introduced into electro-competent GM109 bacteria cells and 
DNA plasmids preparations of these clones were digested with Bell and 
EcoRV and co-ligated to generate clone M (Figure 1A). Clone M was 
digested with NotI and Xhol and re-cloned into pcDNA3.1(+) and 

is pXJ41neo to generate pcDNA3(NSP) and pXJ41(NSP) respectively. 
Construction of full-length cDNA clones of HCV-S1: 
Clones J and M were digested with Cspl and Xbal and the resulting 
3.3 kb fragment from clone J (nucleotides 1-3369 of SEQ ID NO:33) 
including the anchor-5'NCR to NS2 sequence was ligated into clone M to 

20 generate a full length genome of HCV-S1 in pKSH(+/-) (designated 
pKSH(Sl)). To generate the full length clone in pcDNA3.1(+), the 
EcoRV/BsrGI fragment from pKSH(Sl) was ligated to the pcDNA3(NSP) 
digested with the same enzymes to generate pcDNA3(Sl). The same 
fragment was cloned into the blunt-Notl/Bsrgl site in pXJ41(NSP) to 

25 generate pXJ41 (SI). 

Renilla luciferase expression construct: 

The renilla luciferase cDNA (GeneBank Accession number M63501, 
nucleotide coordinates 10-945) including the upstream intron sequence 
from human growth hormone (GeneBank Accession number Ml 343 8, 
30 nucleotide coordinates 569-827) was PCR amplified from pBlND 



WO 02/08447 PCT/IL01/00669 

26 

(Promega) and subcloned into the Hindm site of pcDNA3.1(+). 
Clones containing the insert in the right orientation were isolated and 
verified by sequence analysis. 

Chimeric HCV-Iuciferase constructs: 
5 The firefly luciferase gene (GeneBank Accession number Ml 5077, 

nucleotide coordinates 253-2387) was PCR amplified from the plasmid 
pGL3-Basic (Promega, Madison, WI). The PCR product was digested with 
EcoRI and EcoRV and re-cloned into pcDNA3.1(+) (Clontech) to generate 
the construct pLUCEE(15). The HCV sequence from nt 1- 374 comprising 

10 the full length 5'NCR and the first 33 nt of its core sequence (nucleotides 
1-374 of SEQ ID NO:33) was PCR amplified from HCV-S1. The PCR 
product was digested with Hindlll and EcoRI and cloned into pLUCEE15 
to generate the construct pLUCEE15NC(B2). In order to clone the entire 
3'UTR of HCV-S1 downstream of pLUCEE15NC(B2), the plasmid 

15 pHCV700(A8) (clone I, Figure 1A) was digested with Xcml and EcoRV 
and blunted with Klenow. The resultant insert was cloned into the EcoRV 
site of pLUCEE15NC(B2) and clones with the 3'UTR cloned in the right 
orientation were isolated. One of these clones pLUQ^C3UTR(B9) was 
excised with Hindm and Xhol, blunted with Klenow and cloned into the 

20 EcoRV site of pcDNA3.1(+). Clones with inserts in the anti-sense 
orientation were isolated and designated pAS9 (Figure 1 A). Next, chimeric 
HCV-luciferase constructs which contained HCV NS5B and 3'UTR 
sequences were generated. A region covering the C-tenninal end of the 
NS5B sequence and the complete 3'UTR of HCV-S1 was PCR amplified 

25 from pHCV700(A8) (clone I, Figure A). The PCR product (nucleotides 
9159-9609 of SEQ ID NO:33) was digested with EcoRV and Xhol and 
cloned into pLUCEE15NC(B2) to generate pLUCNC5BUTR(ll). The 
insert from this construct was excised with Hindm and Xhol, blunted with 
Klenow and cloned into the EcoRV site of pcDNA3.1(+). Clones with 

30 inserts in the anti-sense orientation were isolated and named pASl 1 (Figure 
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1A). All constructs were verified via enzymatic restriction digestions 
and sequence analyses. Figure IB illustrates the above described steps 
utilized in generating the chimeric anti-sense expression constructs pAS9 
and pASl 1 and their sense oriented counterparts. 
5 Sequence analysis: 

DNA sequencing of all constructs was carried out using the Taq 
DyeDeoxy terminator cycle sequencing kit and an automated DNA 
sequencer 373 from PE Applied Biosystems (Foster City, CA, USA). 
Cells and cell culture: 
10 The human embryonic kidney cell line, 293, its derivative, 293T, 

which bears the large T antigen from SV40, and the human hepatoma cell 
line HuH-7 were all purchased from American Type Cell Collection 
(ATCC). The cells were cultured in Dulbecco's Minimal Essential Media 
(DMEM) containing 2 mM L-glutamine, and 10% fetal bovine serum and 
15 maintained at 37 °C in 5 %C0 2 . 
Cell transfections: 

Transfections were performed using the Effectene™ transfection 
reagent from QIAGEN (Valencia, CA, USA). Approximately 2 x 10 5 cells 
were plated into 6-well tissue culture plates 14-18 hours prior to 

20 transfection. A total of 1 pg of plasmid DNA in 150 pi EC buflfer was 
mixed with 8 \il of enhancer and vortexed for 1 0 seconds. The mixture was 
allowed to stand at RT for 2-5 minutes, 25 ^1 of Effectene™ transfection 
reagent was added, the mixture vortexed again and incubated at RT for 
another 5-10 minutes. Cells were washed with PBS, added into 

25 DNA-Effectene™ mixture diluted in 2 ml of complete growth medium and 
incubated at 37°C and 5% C0 2 for 6-8 hours. Following incubation, the 
medium was removed and the cells were washed with once with PBS. 
Approximately 2.5 ml of fresh complete medium was added to the cells and 
the cells wore incubated for an aditional 48-120 hours, following which 
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cells were harvested for RNA isolation or western analysis, or treated with 
1000 mg/ml G41 8 for selection of stable clones. 
Luciferase assays: 

Luciferase activity was measured using the a luciferase assay kit 
(Promega, Madison, WI). Following a 72-120 hour incubation period, cells 
were washed twice with PBS and lysed with 100 pi reporter lysis buffer 
(Promega). The lysate was allowed to stand at room temperature for 10-15 
minutes. Following which, the lysate was centrifuged for 1 min in a 
microfuge and a 10 pi aliquot was mixed with 100 pi of reporter buffer 
(Promega); luciferase activity was measured in a Turner luminometer 
(Turner Designs, Sunnydale, CA) over an integration period of 15 seconds. 
In. cells co-transfected with pCMV-Ren, cell pellets were re-suspended in 
100 ml of passive lysis buffer and measured using the dual-luciferase 
system from Promega. Values obtained were normalized with the levels of 
Renilla luciferase activity in the cell lysates and the total protein 
concentration. 

In-vitro translation: 

Translation was effected via the TNT quick coupled 
transcription/translation system from Promega. Briefly, 0.5-1 pg of 
plasmid DNA was mixed with 40 pi of TNT quick master mix and 2 pi of 
35 S methionine (lOmCi/ml) (NEN). The reaction mixture was incubated at 
30 °C for 1-3 hours. Following a predetermined time period, an aliquot was 
removed and SDS-Page analysis was performed. Where indicated, between 
0.3-2.5 pi of canine pancreatic microsomal membranes (Promega) were 
added to the reaction mixture. 

Western blot analysis: 

Cell lysates were resolved on a 10 or 12% sodium dodecyl sulphate 
(SDS>polyacrylamide gel, transferred to a nitrocellulose membrane, 
blocked with 5% nonfat skim milk in PBS, and incubated with a primary 
antibody followed by incubation with anti-mouse or anti-human secondary 
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antibody conjugated to horseradish peroxidase (Sigma). Detection was 
effected using the ECL enhanced chemihuninescence kit (Pierce). The E2 
directed antibody (H52), was a kind gift from J. Dubuisson (Institut de 
Biologie de Lille & Institut Pasteur de Lille, Lille Cedex, France). The NS3 
5 and NS5A directed monoclonal antibodies were purchased from Devaron, 
Inc. (NJ, USA) and Biodesign International (ME, USA) respectively. 

EXPERIMENTAL RESULTS 
Generation of HCV overlapping cDNA clones: 
Sera derived from a single chronic HCV carrier were subjected to 

10 RT-PCR, and nine overlapping cDNAs clones covering the entire HCV 
genome were (Figure 1A). The overlapping regions in these clones had 
almost identical sequences (data not shown). To obtain the complete 5' 
NCR sequence of this isolate, 5' rapid amplification of cDNA ends was 
effected using the 573' RACE kit from Boehringer Mannheim. Following 

15 two rounds of nested PCR, a cDNA fragment comprising the 5' NCR 
region spanning nucleotides -341 to -72 that was missing from clones B 
and C was obtained. The overlapping cDNA clones of isolate HCV-S1 
span 9609 nucleotides encoding a complete polyprotein 3010 amino acids 
long (SEQ ID NO:34), and a 341-nt 5' NCR, and a 235-nt 3' NCR (Figure 

20 1A). To determine the genotype of isolate HCV-S1, the sequence of a 
region of 226 nt within the 5' NCR (from -276 to -21, Figure 1A) (2) as 
well as 233 nt within NS3 (from 4699 to 4932, Figure 1A) and 400 nt 
within NS5B (from 7904 to 8304, Figure 1A) (5) were analyzed. Following 
comparison to available HCV sequences, it was determined that HCV-S1 

25 belongs to the type 1 genotype, with a lb subtype. Sequence comparisons 
of the other two regions were consistent with this finding. 
Characterization of full length HCV genome: 
The full length HCV genome was generated as described 
hereinabove to produce pcDNA3(Sl) and pXJ41(Sl) respectively. To 

30 characterize, this clone, in vitro coupled transcription and translation was 
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first carried out with pcDNA3(SP) and pcDNA3(NSP) using a kit from 
Promega. A single polyprotein larger than 185kD was observed following 
one hour of incubation with pcDNA3(NSP) (Figure 2A, lane 2). Prolonged 
incubation periods gave rise to smaller protein products (Figure 2A, lanes 
4-6). Following two hours of incubation, distinct bands corresponding to 
proteins of approximately 80, 75 and 62 kD in size were also detected 
(Figure 2A, lane 6). It is believed that these products are the result of the 
enzymatic activity of the protease moiety of NS3 and as such these bands 
possibly correspond to NS3-4A (77kD), NS5B (68kD) andNSSA (58kD). 

The construct pcDNA3(SP) contains the entire HCV sequence of the 
core, El and E2 proteins, and the first 115 amino acids of NS2 and as such 
when translated should give rise to a polyprotein of about 82kD. In vitro 
translation experiments with this construct with addition of either an 
enhancer or KC1 produced a single band corresponding to about 82kD 
(Figure 2B, lanes 1-6), whilst addition of magnesium acetate failed to 
produce any band (Figure 2B, lanes 7-9). 

The above described was repeated with the pcDNA3(Sl) construct. 
Following a one hour incubation, a broad band larger than 185kD was 
observed (Figure 2C, lane 1). Following two hours of incubation, several 
smaller bands were observed of sizes ranging from 65 to 140kD. In 
addition, two fainter bands of 60kD and 50kD were also detected (Figure 
2C, lane 2). The intensity of the bands increased slightly when incubation 
was allowed to proceed for three hours (Figure 2C, lane 3). This suggests 
that the HCV polyprotein was proteolytically cleaved in vitro, mostly likely 
by the NS3 protease. Interestingly addition of canine pancreatic 
microsomal membranes (CPMM) led to disappearance of the two upper 
bands of about 140 and 100 kD and reduction in intensity of the lower two 
bands (Figure 2C, lanes 4 and 5). It is likely that these bands represent 
subfragments of the HCV polypeptide and were post-translationally 
processed by the microsomal vesicles. pXJ41(Sl) was transiently 
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transfected into 293T cells, and the egression of HCV proteins was 
examined. Structural (core and E2) and non-structural (NS3, NS5A, NS5B) 
proteins (Figure 3A-G) were detected using available monoclonal or 
polyclonal antibodies. 

These results indicate that the full length HCV genome cloned while 
reducing the present invention to practice, is able to direct the expression of 
the full length polyprotein and is capable of being processed. 

Results of transfection of anti-sense chimeric HCV-Iuciferase 
construct pASB9: 

The 293T and HuH7 cell lines were separately transfected with two 
different clones of pASB9 (pASB9.1 and pASB9.2), which contain an 
anti-sense chimera of the firefly lucif erase gene downstream of a HCV 5* 
NCR-core sequence and upstream of the HCV 3' UTR sequence. 
Transfection was carried out with pASB9 and an equal amount of 
pXJ41(NSP), pXJ41(Sl) or a combination of pXJ41(NS3) and 
pXJ41(NS5B). Co-transfection with the vector, pXJ41neo was used as a 
control to measure background luciferase activity. The cells were harvested 
and assayed for luciferase activity 5 days post-transfection. There was no 
observed increase in luciferase activity in co-transfection experiments with 
any of the HCV expression constructs compared to co-transfection with the 
vector (data not shown). Experiments carried out with a total of 1 or 2 ? g 
of DNA produced similar results. 

Results of transfection of anti-sense chimeric HCV-luciferase 
construct pASll: 

Similar experiments were carried with the anti-sense construct 
pASll (pASll-12 and pASll-15) which contains the anti-sense chimera 
of the firefly luciferase gene downstream of a HCV 5' NCR-core sequence 
and upstream of the HCV NS5B-3' UTR sequence. In 293T cells, 
co-transfection with the full length HCV expression plasmid, pXJ41(Sl) 
and pASll-12 produced a 10-fold increase over background luciferase 
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activities five days post transfection, while a 14.7-fold increase was 
observed with pASll-15 co-transfected with pXJ41(Sl) (Table 2). In 
similarly transfected HuH7 cells, luciferase activities were 2.7-fold and 
5.8-fold above background values three days post transfection (Table 3). At 

5 five days post transfection, the luciferase activities in HuH7 cells slightly 
increased to 3.7-fold and 6.2-fold respectively (Table 4). However, 
co-transfection of pASll-12 or -15 with the NS proteins expression vector 
(pXJ41(NSP)) or the vector including NS3 and NS5B, resulted in no 
detectable increase in luciferase activity as compared to transfection with 

10 vector alone (Tables 2-4). 



Table 2 





Constructs 


Rl 


R2 


Av 




1 


pASn(12HpXJ41nco 


1.20 


1.06 


1.13 




2 


pASll(12>H>XJ41(NS3)+pXJ41(NS5B) 


\J2 


1.46 


134 




3 


pASll(12>H>XJ41(NSF) 


1.40 


130 


135 




4 


pASll(12>fpXJ41(Sl) 


1234 


11.11 


11.73 


1036X 














5 


pASll(15)*vector 


139 


1.46 


1.42 




6 


pASll(15>+iiXWl(NS3>+T>XJ41(NS5B) 


152 


153 


1.73 




7 


pASll(15>H>XMl(NSP) 


131 


133 


132 




8 


pASll(15>*pXJ41(Sl) 


21.79 


20.09 


2054 


14.7X 














9 


pll(3)+pXMlneo 


5106.2 


5082.7 


50945 




10 


pll(6>pXJ41neo 


56115 


5440.0 


5525.8 




11 


pXMlneo 


0 


-0.2 


0 

















pi 1(3) -sense, done II 
. pi 1(6) -sense, clone pi 1 
15 R - luciferase reading 

Av - average luciferase reading 



Table 3 





Constructs 


Rl 


R2 


Av 




1 


pAS1102>fpXJ41neo 


552 


5.27 


5.40 




2 


pASl 102>hiXJ41(NS3)+pXJ41(NS5B> 


3.10 


258 


3.04 
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3 


pASll(12)+pXJ41(NSP) 


4.11 


4.12 


4.11 




4 


pASll(12>fpXJ41(Sl) 


15.21 


14.26 


14.74 


2.73X 














5 


pASll{l5>Wtctor 


3.25 


3.70 


3.47 




6 


pASll(15>H>XJ41(NS3>fpXJ41(NS5B) 


3.15 


3.23 


3.19 




7 


pASll(15>+pXJ41(NSP) 


3.24 


3.14 


3.19 




8 


pASll(15>fpXJ41(Sl) 


19.87 


20.07 


19.97 


5.76X 














9 


pll(3>fpXJ41neo 


10348.8 


9848.0 


10098.4 




10 


pll(6}fpXJ41neo 


13418.2 


12821.6 


13119.9 




11 


pXMlneo 


0 


0 


0 

















pll(3)- sense, clone 11 
pi 1(6) - sense, done pi 1 
R - hjciferase leading 
5 Av - average hiciferase reading 

Table 4 





Constructs 


Rl 


R2 


Av 




1 


pASll(12)*pXJ41neo 


2.03 


2.09 


2.06 




2 


pASl l(12)+i>XJ41{NS3>fpXJ41(NS5B) 


0.82 


0.66 


0.74 




3 


pASll(12>+pXJ41(NSP) 


136 


1.09 


\32 




4 


pASHC12>*pXJ41(Sl) 


8.26 


658 


7.62 


3.7X 














5 


pASll(15>Wectar 


Z19 


2.02 


2.10 




6 


pASl U15>fpXJ41(NS3>fDXJ41(NS5B) 


052 


0.79 


0.86 




7 


pASll(l5>*pXJ41(NSP) 


0.63 


0.61 


0.62 




8 i 


pASll(15>H>XJ41(SI) 


13.89 


1Z28 


13.09 


6.23X 














9 


pll(3>fpXJ41neo 


3565.8 


34503 


3508.1 




10 


pll(6>fpXJ41neo 


4197.2 


38615 


4030 




11 


pXMlneo 


0 


0 


0 

















pll(3)- sense, clone 11 
pll(6) - sense, done pi 1 
10 R - hiciferase reading 

Av - average hiciferase reading 
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Results of co-transfection with pASll and pCMV-Ren: 

Similar experiments were conducted using a renilla expression 
construct pCMV-Ren, in order to account for any variation in luciferase 
activity due to different transfection efficiencies. 293T and HuH7 cells were 
transfected with a total of 1 ug of DNA and cells were harvested and 
analyzed 3 days post-transfection. All values obtained were normalized 
against total protein concentration and renilla luciferase activity. In 293T 
cells, co-transfection of pASll-15 with pXJ41(Sl) resulted in a 18.5-fold 
increase over background luciferase activity (Table 5). In HuH7 cells, the 
luciferase activity of pASll-15 was 3.9-fold higher when co-transfected 
withpXJ41(Sl)(Table6). 



Table 5 







SXdflution 


5X dilution 




5X dilution 


Neat 






Constructs 


AvFFLUC 


AvREN 


N_Ren(X) 


N.AvFFL 


Av 
FFL 


Total 
FFL 


1 


P ASll(12>fpXJ4lDco 


0J9 


1491 


1.00 


039 


8.56 


8.56 


2 


pASll(12>+pXJ41(Sl) 


0.72 


478 


3.12 


225 


145.80 


158.146 


3 • 


ASHOy^pXMlneo 


37.13 


1626 


022 


34.16 


9538.0 
0 


11445.6 




















































Constructs 


ProtConc 


PN(X) 


Total FFL 


Final FFL 






1 


pAS11(12>+tiXJ41dco 


0303 


1 


8.56 








2 ! 


pASlin2HiiXJ41(Sl) 


0.284 


1.07 


145.80 


158.15 


18.48X 




3 


ASll(3>+l>XJ41neo 


0.253 j 


\2 


9538.00 


11445.60 























Av FF LUC = average of 2 firefly luciferase readmgs fi^ 20 lnl <>f 5X dflutrf ccfl lysate 
Av REN" average of 2 renilla luciferase readings from 20ml of 5X diluted cell rysate 
N. Ren (X)= normalisation index of renilla luciferase readings from 20 ml of 5X diluted cell rysate 
N. Av FFL^ average of 2 firefly luciferase reading from 20 ml of 5X diluted cell rysate after norrnalisation against renilla 



luciferase index 

Av FF1> normalised average of 2 firefly tociferase readings from 20 ml of eel] rysate (neat) 
Fmal Total FFU= normalised average of 2 firefly luciferase readings of total cell rysate 
PN (X>= protein normalisation index 

Final FFLf final average of 2 firefly luciferase readings of total ccH rysate 



WO 02/08447 



PCT/IL01/00669 



35 

Table 6 







5X dilution 


5X dilution 




5X dilution 


Neat 


Neat 




Constructs 


Av FF LUC 


AvREN 


N.Ren(X) 


N.AvFFL 


AvFFL 


Total 
FFL 


1 






1491 


1.00 


039 


197 


7.86 


*> 




A *?"> 

0.72 


478 


3.12 


125 


1123 


44.92 


3 


A2>1 l(3/rpXJ4 Jneo 


37.13 


1626 


0.92 


34.16 


17.08 


68320 




















































Constructs 


Plot Cone 


PN(X) 


Total FFL 


Final FFL 






1 


pASll(12>*pXJ41neo 


0.063 


1 


7.86 


7.86 






2 


pASU(12)4pXJ41(Sn 


0.093 


0.68 


44.92 


30.55 


3.88X 




3 


ASll(3>fpXJ41nco 


0.106 


0.59 


683.20 


403.09 























Av FF LUC — average of 2 firefly luciferase readings from 20 ml of 5X diluted cell iysate 
Av REN » average of 2 renilla luciferase readings from 20 ml of 5X diluted cell rysate 
N. Ren PQ = normalisation index of renilla luciferase readings from 20 ml of 5X diluted cdl Iysate 



5 N. Av FFL" average of 2 firefly luciferase readings from 20 ml of 5X diluted cell Iysate after normalisation against renilla 
luciferase index 

Av FFL=* normalised average of 2 firefly luciferase readings from 20 ml of cell Iysate (neat) 
Final Total FFL= normalised average of 2 firefly luciferase readings of total cell Iysate 
PN (X>= protein normalisation index 
10 Final FFLf= final average of 2 firefly mcrterase readings of total cell Iysate 

Cells co-transfected with pASB9 with different HCV expression 
constructs failed to produce changes in luciferase activity (data not shown). 
However, pASl 1 consistently produced increased luciferase activities when 

15 co-transfected with pXJ41(Sl), which expresses the foil length HCV 
genome. In 293T cells, the levels were between 10.4-14.7 folds above 
background levels, and in HuH7 cells they were between 2.7-6.2 folds 
(Tables 2-4). Even after normalizing with co-transfection with a plasmid 
that expresses renilla luciferase, a significant increase in luciferase activities 

20 was observed In 293T cells, the increase was 17-fold above background, 
while in HuH7 cells, it was 3.9-fold (Tables 5 and 6). These results indicate 
that the additional C-terminal NS5B coding sequence present only in 
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pASll is important and necessary for theNSSB polymerase (and perhaps 
other factors) to bind efficiently and initiate reverse strand synthesis. 

Several reports have shown that in vitro provided NS5B is capable of 
binding and initiating the synthesis of sequences containing the 3' UTR 
alone (17, 18). Yet, the experiments conducted while reducing the present 
invention to practice clearly indicate that the 3' UTR alone is insufficient in 
promoting polymerase activity in vivo. As such, this is the first 
demonstration that the NS5B region works together with the 3' UTR to 
facilitate negative strand synthesis in vivo. 

Interestingly co-transfection with an expression vector for the 
non-structural proteins, pXJ41(NSP) or with expression vectors for NS3 
and NS5B did not result in any increase in luciferase activity when 
compared to co-transfection with the vector alone. This suggests that the 
synthesis of the sense strand HCV-luciferase chimeric RNA by the HCV 
NS5B polymerase is dependent on multiple viral proteins, including both 
non-structural and viral protein(s). It also indicates that a foil length 
replication-competent HCV genome is required for this assay to be 
functional. 

This is the first demonstration that negative strand synthesis depends 
on expression of essentially all the viral proteins in intact cells. Based on 
these findings, the present invention provides a cell-based HCV 
replication-dependent system that is a measure of the activity of the 
full-length HCV genome. This system is simple, and robust and highly 
reproducible and in addition, enables to measure viral activity as early as 
three days post-transfection. 

Although the invention has been described in conjunction with 
specific embodiments thereof, it is evident that many alternatives, 
modifications and variations will be apparent to those skilled in the art. 
Accordingly, it is intended to embrace all such alternatives, modifications 
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and variations that fall within the spirit and broad scope of the appended 
claims. All publications, patents, patent applications and sequences 
disclosed therein and/or identified by a GeneBank accession number 
mentioned in this specification are herein incorporated in their entirety by 
reference into the specification, to the same extent as if each individual 
publication, patent, patent application or sequence was specifically and 
individually indicated to be incorporated herein by reference. In addition, 
citation or identification of any reference in this application shall not be 
construed as an admission that such reference is available as prior art to the 
present invention. 
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WHAT IS CLAIMED IS: 

1 . A nucleic acid construct comprising: 

(a) an expression cassette including: 

(i) a first polynucleotide region including a 5' NCR 
sequence of an RNA virus and at least an N-terminal 
portion of a coding sequence of said RNA virus; 

(ii) a second polynucleotide region including a 3' UTR 
sequence of said RNA virus and at least a C-tenninal 
portion of a coding sequence of said virus; and 

(iii) a third polynucleotide region encoding a reporter 
molecule, said third polynucleotide region being 
flanked by said first and said second polynucleotide 
regions; and 

(b) a promoter sequence being operatively linked to said 
expression cassette in a manner so as to enable a transcription 
of a minus strand RNA molecule from said expression 
cassette. 

2. The nucleic acid construct of claim 1, wherein at least a 
portion of said first polynucleotide region is at least 50 % identical to a 
sequence encompassed by nucleotides 1-374 of SEQ ID NO:33. 

3. The nucleic acid construct of claim 1, wherein at least a 
portion of said second polynucleotide region is at least 50 % identical to a 
sequence encompassed by nucleotides 9158-9609 of SEQIDNO:33. 

4. The nucleic acid construct of claim 1, wherein said first 
polynucleotide region further includes a 5' UTR sequence of said RNA 
virus. 
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5. The nucleic acid construct of claim 1, wherein said 
C-terminal portion of said coding sequence of said virus includes coding 
sequences of a polymerase of said virus. 

6. The nucleic acid construct of claim 1, wherein said first 
polynucleotide region includes an IRES sequence. 

7. The nucleic acid construct of claim 1, wherein said RNA 
virus is selected from the group consisting of a positive strand RNA virus 
and a negative strand RNA virus. 

8. The nucleic acid construct of claim 1, wherein said RNA 
virus is selected from the group consisting of a virus of the picornavirus 
family, a virus of the togavirus family, a virus of the orthomyxovirus 
family, a virus of the paramyxovirus family, a virus of the coronavirus 
family, a virus of the calicivirus family, a virus of the arenavirus family, a 
virus of the Aabdovirus family and a virus of the bunyavirus family. 

9. The nucleic acid construct of claim 1, wherein said RNA 
virus is Hepatitis C. 

10. The nucleic acid construct of claim 1, wherein said first and 
said second polynucleotide regions are selected such that said minus strand 
RNA molecule transcribable from said expression cassette is replicatable by 
an RNA dependent RNA polymerase of said virus into a plus strand RNA 
molecule. 

11. The nucleic acid construct of claim 1, wherein said promoter 
is functional in a eukaryotic cell. 



WO 02/08447 PCT/IL01/00669 

43 

12. The nucleic acid construct of claim 1 1, wherein said 
eukaryotic cell is selected from the group consisting of an insect cell, a 
yeast cell and a mammalian cell. 

13. The nucleic acid construct of claim 1, wherein said reporter 
molecule is a polypeptide selected from the group consisting of an enzyme, 
a fluorophore, a substrate and a ligand. 

14. A genetically transformed cell comprising a nucleic acid 
construct including: 

(a) an expression cassette including: 

(i) a first polynucleotide region including a 5' NCR 
sequence of an RNA virus and at least an N-terminal 
portion of a coding sequence of said RNA virus; 

(ii) a second polynucleotide region including a 3' UTR 
sequence of said RNA virus and at least a C-tenninal 
portion of a coding sequence of said virus; and 

(iii) a third polynucleotide region encoding a reporter 
molecule, said third polynucleotide region being 
flanked by said first and said second polynucleotide 
regions; and 

(b) a promoter sequence being operatively linked to said 
expression cassette in a manner so as to enable a transcription 
of a minus strand RNA molecule from said expression 
cassette., 

15. The genetically transformed cell of claim 14, further 
comprising an additional nucleic acid construct for expressing at least an 
RNA dependent RNA polymerase of a virus, said first and said second 
polynucleotide regions being selected such that said RNA dependent RNA 
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polymerase is capable of replicating said minus strand RNA molecule 
into plus strand RNA. 

16. The genetically transformed cell of claim 14, wherein at least 
a portion of said first polynucleotide region is at least 50 % identical to a 
sequence encompassed by nucleotides 1-374 of SEQ IDNO:33. 

17. The genetically transformed cell of claim 14, wherein at least 
a portion of said second polynucleotide region is at least 50 % identical to a 
sequence encompassed by nucleotides 9158-9609 of SEQEDNO:33. 

18. A method of detecting a presence of an RNA virus in a cell, 
the method comprising the steps of: 

(a) incubating a nucleic acid construct with an extract of the cell 
under conditions suitable for transcription and translation of 
said nucleic acid construct, said nucleic acid construct 
including: 

(i) an expression cassette having: 

(one) a first polynucleotide region including a 5' NCR 
sequence of an RNA virus and at least an 
N-terminal portion of a coding sequence of said 
RNA virus; 

(two) a second polynucleotide region including a 3' 
UTR sequence of said RNA virus and at least a 
C-terminal portion of a coding sequence of said 
virus; and 

(three) a third polynucleotide region encoding a 
reporter molecule, said third polynucleotide 
region being flanked by said first and said 
second polynucleotide regions; and 
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(ii) a promoter sequence being operatively linked 
to said expression cassette in a manner so as to direct 
the transcription of a minus strand RNA molecule from 
said expression cassette when said nucleic acid 
construct is incubated with said extract, said first and 
said second polynucleotide regions being selected such 
that said minus strand RNA molecule transcribed is 
replicatable by a polymerase of the RNA virus into a 
plus strand RNA molecule; and 
(b) quantifying a level of said reporter molecule to thereby 
determine the presence of the virus in the cell. 

19. The method of claim 18, wherein said reporter molecule is a 
polypeptide translated from said plus strand RNA molecule. 

20. The method of claim 18, further comprising the step of 
comparing said level of said reporter molecule to that obtained from cells 
free of the virus. 

21. The method of claim 18, wherein at least a portion of said 
first polynucleotide region is at least 50 % identical to a sequence 
encompassed by nucleotides 1-374 of SEQ ID NO:3S. 

22. The method of claim 18, wherein at least a portion of said 
second polynucleotide region is at least 50 % identical to a sequence 
encompassed by nucleotides 9158-9609 of SEQIDNO:33. 

23. A method of detecting the presence of an RNA virus in a cell, 
the method comprising the steps of: 
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(a) expressing a nucleic acid construct within the cell, said 
nucleic acid construct including: 

(i) an expression cassette having: 

(one) a first polynucleotide region including a 5 1 NCR 
sequence of an RNA virus and at least an 
N-terminal portion of a coding sequence of said 
RNA virus; 

(two) a second polynucleotide region including a 3' 
UTR sequence of said RNA virus and at least a 
C-terminal portion of a coding sequence of said 
virus; and 

(three) a third polynucleotide region encoding a 
reporter molecule, said third polynucleotide 
region being flanked by said first and said 
second polynucleotide regions; and 

(ii) a promoter sequence being operatively linked to said 
expression cassette in a maimer so as to direct the 
transcription of a minus strand RNA molecule from 
said expression cassette when said nucleic acid 
construct is expressed within the cell, said first and said 
second polynucleotide regions being selected such that 
said minus strand RNA molecule transcribed is 
replicatable by a polymerase of the RNA virus into a 
plus strand RNA molecule; and 

(b) quantifying a level of said reporter molecule to thereby 
determine the presence of the virus in the cell. 

24. The method of claim 23, wherein said reporter molecule is a 
polypeptide translated from said plus strand RNA molecule. 
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25. The method of claim 23, further comprising the step of 
comparing said level of said reporter molecule to that obtained from cells 
free of the virus, 

26. The method of claim 23, wherein at least a portion of said 
first polynucleotide region is at least 50 % identical to a sequence 
encompassed by nucleotides 1-374 ofSEQ ID NO:33. 

27. The method of claim 23, wherein at least a portion of said 
second polynucleotide region is at least 50 % identical to a sequence 
encompassed by nucleotides 9158-9609 of SEQIDNO:33. 

28. A method of screening for anti-viral drugs, the method 
comprising the steps of: 

(a) co-incubating a nucleic acid construct, a polynucleotide 
encoding at least a polymerase of an RNA virus and a 
potential anti-viral molecule under conditions suitable for 
transcription and translation of said nucleic acid construct and 
said polynucleotide encoding at least said polymerase, said 
nucleic acid construct including: 
(i) an expression cassette having: 

(one) a first polynucleotide region including a 5' NCR 
sequence of an RNA virus and at least an 
N-terminal portion of a coding sequence of said 
RNA virus; 

(two) a second polynucleotide region including a 3* 
UTR sequence of said RNA virus and at least a 
C-terminal portion of a coding sequence of said 
virus; and 
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(three)a third polynucleotide region encoding a 
reporter molecule, said third polynucleotide 
region being flanked by said first and said 
second polynucleotide regions; and 
(ii) a promoter sequence being operatively linked to said 
expression cassette in a manner so as to direct the 
transcription of a minus strand RNA molecule from 
said expression cassette when said nucleic acid 
construct is incubated with said polynucleotide 
encoding at least said polymerase of said RNA virus 
under said conditions suitable for transcription and 
translation, said first and said second polynucleotide 
regions being selected such that said minus strand 
RNA molecule transcribed is replicatable by said 
polymerase of said RNA virus into a plus strand RNA 
molecule; and 

(b) quantifying a level of said reporter molecule to thereby 
determine the anti-viral activity of said potential anti-viral 
molecule. 

29. The method of claim 28, wherein said reporter molecule is a 
polypeptide translated from said plus strand RNA molecule. 

30. The method of claim 28, further comprising the step of 
comparing said level of said reporter molecule to that obtained from cells 
free of the virus. 

31. The method of claim 28, wherein said potential anti-viral 
molecule is selected from the group consisting of a nucleoside or a 
nucleotide analogue and an immune-modulatory molecule. 
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32. The method of claim 28, wherein step (a) is effected by 
introducing said nucleic acid construct, said polynucleotide encoding at 
least said polymerase of said RNA virus and said potential anti-viral 
molecule into a cell. 

33. The method of claim 28, wherein step (a) is effected by 
introducing said nucleic acid construct and said potential anti-viral 
molecule into a cell infected with said RNA virus. 

34. The method of claim 28, wherein at least a portion of said 
first polynucleotide region is at least 50 % identical to a sequence 
encompassed by nucleotides 1-374 of SEQ ID NO:33. 

35. The method of claim 28, wherein at least a portion of said 
second polynucleotide region is at least 50 % identical to a sequence 
encompassed by nucleotides 9158-9609 of SEQIDNO:33. 

36. A method of determining drug resistance of an RNA virus, 
the method comprising the steps of: 

(a) co-incubating a nucleic acid construct, a polynucleotide 
encoding at least a polymerase of the RNA virus and an 
anti-viral drug molecule under conditions suitable for 
transcription and translation of said nucleic acid construct and 
said polynucleotide encoding at least said polymerase, said 
nucleic acid construct including: 
(i) an expression cassette having: 

(one) a first polynucleotide region including a 5' NCR 
sequence of an RNA virus and at least an 
N-terminal portion of a coding sequence of said 
RNA virus; 
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(two) a second polynucleotide region including a 
3* UTR sequence of said RNA virus and at least 
a C-terminal portion of a coding sequence of 
said virus; and 
(three)a third polynucleotide region encoding a 
reporter molecule, said third polynucleotide 
region being flanked by said first and said 
second polynucleotide regions; and 
(ii) a promoter sequence being operatively linked to said 
expression cassette in a manner so as to direct the 
transcription of a minus strand RNA molecule from 
said expression cassette when said nucleic acid 
construct is incubated with said polynucleotide 
encoding at least said polymerase of the RNA virus 
under said conditions suitable for transcription and 
translation, said first and said second polynucleotide 
regions being selected such that said minus strand 
RNA molecule transcribed is replicatable by said 
polymerase of the RNA virus into a plus strand RNA 
molecule; and 

(b) quantifying a level of said reporter molecule to thereby 
determine the resistance of the RNA virus to said anti-viral 
drug. 

37. The method of claim 36, fiirther comprising the step of 
comparing said level of said reporter molecule to that obtained from cells 
free of said anti-viral drug. 



38. The method of claim 36, wherein said reporter molecule is a 
polypeptide translated from said plus strand RNA molecule. 
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39. The method of claim 36, wherein said anti-viral drug is 
selected from the group consisting of a nucleoside or nucleotide analog and 
an immunomodulatory toolecule. 

40. The method of claim 36, wherein step (a) is effected by 
introducing said nucleic acid construct, said polynucleotide encoding at 
least said polymerase of said RNA virus and said anti-viral drug into a cell. 

41. The method of claim 36, wherein step (a) is effected by 
introducing said nucleic acid construct and said anti-viral drug into a cell 
infected with the RNA virus. 

42. The method of claim 36, wherein at least a portion of said 
first polynucleotide region is at least 50 % identical to a sequence 
encompassed by nucleotides 1-374 of SEQ ID NO:33. 

43. The method of claim 36, wherein at least a portion of said 
second polynucleotide region is at least 50 % identical to a sequence 
encompassed by nucleotides 9158-9609 of SEQIDNO:33. 
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SEQUENCE LISTING 

<110> Siew Pheng Lim et al. 

<120> NUCLEIC ACID CONSTRUCTS AND METHODS OF UTILIZING SAME FOR DETECTING VIRAL 
INFECTION, FOR UNCOVERING ANTI-VIRAL DRUG CANDIDATES AND FOR DETERMINING 
DRUG RESISTANCE OF VIRAL ISOLATES 

<I30> 01/22137 

<160> 32 

<170> Patentln Ver. 2.X 

<210> 1 
<211> 30 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 1 . 

ACTGTCTTCA CGCAGAAAGC GTCTAGCCAT 30 

<210> 2 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 2 

CACGCAGAAA GCGTCTAGCC AT 22 

<210> 3 
<211> 29 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 3 

CGAGACCTCCC GGGGCACTCG CAAGCACCC 29 

<210> 4 
<211> xx 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Description of Artificial sequence; PCR primer 
<400> 4 

TCCCGGGGCACTCGCAAGCACCCTATCAGG 



<210> 5 
<211> 30 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 5 

CTATCAGGCA GTACCACAAG GCCTTTCGCG 30 



<210> 6 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 6 

CCCGCYAGGA CYCCCCAGTGG 20 

<210> 7 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 7 

GCCCAGTTCC CCACCATGGA 20 

<210> 8 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 



<400> 8 

AGGGCAGTCC TGTTGATGTG C 21 
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<210> 9 
C211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 9 

AGGCTATCAT TGCAGTTCAG GGC 23 

<210> 10 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 10 

CCACTGGGGR GTCCTRGCGG G 21 

<210> 11 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 11 

TCCATGGTGG GGAACTGGGC 20 



<210> 12 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 12 

CGCCTCCGCA CGATGCAGCC AT 22 



<210> 13 
<211> 23 
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<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 13 

AGTACCAGAC CTATGAAAAC CGC 23 

<210> 14 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 14 

ATGGACCGGG AGATGGCTGC A 21 



<210> 15 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 15 

AGGCTTTAGC CGTGTGAGAC A 21 

<210> 16 
<211> 21 
<212> DMA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 16 

GCGCCYATCA CGGCCTACTC C 21 

<210> 17 
<211> 21 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 17 

GACGACCTCC AGGTCAGCCG A 21 

<210> 18 
<211> 21 
<212> DKA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 

<400> 18 t 
ACGCCCACTT CTTGTCTCAG A 21 



<210> 19 
<211> XX 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 19 

ACTAAGCAGG CAGGAGACAA C 21 

<210> 20 
<211> 21 
<212> DMA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 20 

TTGATGGGTA ATTTGCTCTC C 21 

<210> 21 
<211> xx 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
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<400> 21 

GTGGTGACGC AGCAAGGAGT T 21 

<210> 22 
<211> xx 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 22 

CAGCGACGGG TCTTGGTCTA C 21 

<210> 23 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 23 

TCACCGGTTG GGGAGCAGAT AG 22 

<210> 24 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 24 

TCTACGGGGC CTACTACTCC ATT 23 

<210> 25 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 25 

CTACTACTCC ATTGAGCCAC TTGftC 25 



WO 02/08447 



PCT/IL01/00669 



7 

<210> 26 
<211> 33 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 26 

ACATGATCTG CAGAGAGGCC AGTATCAGCA CTC 33 

<210> 27 
<211> 27 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 27 

GTCAAGTGGC TCAATGGAGT AGTAGGC 27 

<21Q> 28 
<211> 85 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 28 

GCCAGCCCCC GATTGGGGGC GACACTCCAC CATAGATCAC TCCCCTGTGA GGAACTACTG 60 
TCTTCACGCA GAAAGCGTCT AGCCA 85 

<210> 29 
<211> 40 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 29 

GACCACGCG TATCGATGT CGACTTTTTT TTTTTTTTTT 40 



<210> 30 
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<211> 22 
<212> DNA, 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 30 

GACCACGCGT ATCGATGTCG AC 22 

<210> 31 
<211> 27 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 31 

ATGGACCAGG AGTTGGCTGC ATCGTGC 27 

<210> 32 
<211> 27 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: PCR primer 
<400> 32 

CTAACGCGCA CGCACGAATG AGGCCTT 27 

<210> 33 
<211> 9609 
<212> DNA 

<213> Hepatitis C virus 
<400> 33 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgtag cctccgggac 120 
cccccctccc gggagagcca tggtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactac tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg attcctaaac 360 
ctcaaagaaa aaccaaacgt aacacctacc gccgcccaca ggacgtcaag ttcccgggcg 420 
gtggccagat cgttggtgga gtttacctgt tgccgcgcag gggccccagg ttgggtgtgc 480 
gcgcgactag gaagacttcc gagcggtcgc aacctcgtgg aaggcgacag cctatcccca 540 
aggctcgccg gcccgagggc tgggcctggg cacaacccgg gtacccttgg cccctctatg 600 
gcaatgaggg cttggggtgg gcaggatggc tcctgtcacc ccgtggctcc cgccctagtt 660 
ggggccccat agacccccgg cgtaggtcgc gtaatttggg taaggtcatc gataccctca 720 
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catgcggctt cgccgacctc atgggataca ttccgctcgt cggcgccccc ttagggggcg 780 
ctgccagggc cctggcgcat ggcgtcaggg ttctggagga cggcgtgaac tatgcaacag 840 
ggaaccttcc cggttgctct ttctctatct tcctcttggc tttgctgtct tgtttgacca 900 
tcccagcttc cgcttatgaa gtgcgcaacg cgtccggggt gtaccatgtt acgaacgact 960 
gctccaactc aagcattgtg tatgaggcag cggacataat catgcacacc cccggatgcg 1020 
taccctgcgt tcgggagaac aactcttccc gctgctgggt agcgctcact cccacgctcg 1080 
cggccaggaa cctcagcgtc cccaccacga caatacgacg ccatgtcgat ttgctcgttg 1140 
gggcggctgc tttctgctcc gctatgtacg tgggggatct ttgcggatcc gttctactcg 1200 
tctcccagct gttcaccctc tcgcctcgcc agcatgagac agtacaagat tgcaattgct 1260 
cactttatcc cggccacgta acaggtcacc gcatggcttg ggatatgatg atgaattggt 1320 
cgcctacaac agccctagtg ttatcgcaga tactccggat cccacaaact atcgtggaca 1380 
tggtggcggg ggcccactgg ggagtcctgg cgggcattgc ctactattcc atggtgggga 1440 
actgggctaa ggttttggtt gtgatgctac tctttgctgg cgttgacggg cacacccaag 1500 
tgatgggggg gtcgcaagcc tctaccatca acacgctcac gggaattttc tcccccgggg 1560 
cgaaacagaa gattcagctc ataaacacca acggcagttg gcacatcaac aggactgccc 1620 
tgaactgcaa tgactccctc aacactgggt tccttgccgc gctgttctac acccacagct 1680 
tcaactcgtc cggatgccta gagcgtatgg ccagctgccg ccccattgac aagttcgatc 1740 
aggggtgggg tcccattact tatgatgaag gccccgattt agaccagagg ccctattgct 1800 
ggcactacgc accccggtcg tgcggtattg tgcccgcgtc gcaggtgtgt ggcccagtgt 1860 
attgtttcac cccaagccct gtcgtggtgg ggacgaccga tcgtaccggt gcccccacgt 1920 
atagatgggg ggagaatgag acggacgtgc tgatcctcaa caacacgcgg ccgccgcagg 1980 
gcaactggtt cggctgcaca tggatgaata gcactgggtt caccaagaca tgcgggggcc 2040 
ccccatgtaa catcgggggg gctggtaaca acaccttggt ctgccccacg gactgcttcc 2100 
ggaaacaccc cgaggctact tacaccaaat gcggttcggg accctggttg acacctaggt 2160 
gcatggttga ctacccatac aggccttggc actacccctg tactgtcaac tttaccattt 2220 
tcaaggttag gatgtatgtg gggggcgtag agcacaggct caacgctgca cgcaactgga 2280 
ctcgaggaga gcgttgtgac ttggaggaca gggatagatc agagcttagc ccgctgctgc 2340 
tgtctacgac agagtggcag atactgcctt gctccttcac cactctaccg gctctgtcca 2400 
ctggcttgat ccacctccat cagaacatcg tggacgtgca gtatctgtac ggtatagggt 2460 
cagtggttgt ctccctcgtg atcaaatggg agtatgtctt gctgtccttc ttcctcctgg 2520 
cggacgcgcg cgtctgtgcc tgtttgtgga tgatgctgct gatagcccag gctgaggccg 2580 
ccttagagaa cctggtggtc ctcaatgcgg cgtccgtggc cggagcgcat ggcatcctca 2640 
ccttccttgt gttcttctgt gctgcctggt acatcaaggg caggctggtc cctggggcgg 2700 
catatgcttt ttacggcgtg tggccgctgc tcctgctctt gctggcgctg ccaccacgag 2760 
cgtacgctat ggaccgggag atggctgcat cgtgcggagg cgcggttttc ataggtctgg 2820 
tactcctgac cctgtcacca cactataaag tgttccttgc taggctcata tggtggttgc 2880 
agtactttac caccagagcc gaggcaattt tgcacgtgtg ggtcccccct ctcaacgtcc 2940 
gagggggccg tgatgccgtc attctcctca cgtgcgcggt ccacccagac ctaatctttg 3000 
acatcaccaa actcttgctc gctgtgctcg gtccgctcat ggtgtttctg gccggcataa 3060 
caagggtgcc atacttcgtg cgcgcccaag gcctcattcg tgcgtgcgcg ttagcgcgga 3120 
aggtcgctgg gggtcattat atccaaatgg ctctcatgaa gctggccgcg ctgacaggca 3180 
cgtatcttta cgaccatctt actccactgc gggactgggc ccatgcgggc ctgcgagacc 3240 
ttgcggtagc agtcgagcct gtcgtctttt ctgacatgga gaccaagatc atcacctggg 3300 
gggcagacac cgcagcgtgt ggggacatca tcctgggtct gcctgtctcc gctcgaaggg 3360 
ggagggagat acttttggga ccagccgata gtcttgaagg gcaggggtgg cgactccttg 3420 
cgcctattac ggcctactcc cagcagacgc ggggcctact tggctgcatc atcactagtc 3480 
tcacaggtcg ggacaagaac caggtcgagg gggaggttca agtggtttcc accgcaacac 3540 
agtctttcct agcgacctgc gtcaacggcg tgtgttggac tgtcttccat ggtgccggct 3600 
caaaaacctt agccggtcca aaggggccaa ttactcaaat gtacaccaac gtggaccagg 3660 
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acctcgtcgg ctggcaggcg ccccccgggg cgcgttcctt gacaccatgc acctgcggca 3720 
gctcggacct ttacctggtc acgaggcacg ctgacgteat tccggtgcgc cggcggggcg 3780 
acaacagagg gagcctactc tcccccaggc ccgtctccta cttaaagggc tcttcgggtg 3840 
gtccactcct ctgcccctcg gggcatgccg taggcatctt ccgggccgcc gtatgcaccc 3900 
ggggggttgc gaaggcggtg gactttatac ccgttgagtc tatggagact accatgcggt 3960 
ccccggtctt cacagacaat tcatcccccc cggccgtacc gcagacattc caagtggctc 4020 
atctacacgc tcccactggc agcggtaaga gcactaaggt gccggctgcg tatgcagccc 4080 
aagggtacaa ggtacttgtc ctgaacccgt ccgtggccgc taccttaggg ttcggaacgt 4140 
atatgtctaa ggcatatggt gtcgatccta acatcagaac cggggtaagg accatcacca 4200 
cgggcgcccc catcacgtac tccacttacg gcaagtttct tgccgacggt ggttgctccg 4260 
ggggagccta cgacatcata atgtgtgatg aatgccactc aactgactca actactgtct 4320 
tgggcatcgg cacagtcctg gaccaagcgg agacggctgg agcgcggctt gtcgtgctcg 4380 
ccaccgctac gcctccggga tcggtcaccg tgccacatcc caatattgag gagatcgctc 4440 
tatccaacac tggagagatc cccttctatg gcaaagccat ccccattgag accatcaagg 4500 
gggggaggca tctcattttc tgccattcca agaagaaatg tgacgagctc gctgcaaagt 4560 
tgtcgggcct cggacttaac gctgtagcgt attaccgggg ccttgatgtg tccgttatac 4620 
caactagcgg agacgtcgtt gtcgtggcaa cagacgctct aatgacgggc tttaccggcg 4680 
attccgactc agtgatcgac tgtaatacat gtgtcaccca gacagttgat ttcagcttgg 4740 
accctacctt caccattgag acgacgaccg tgccccaaga cgcggtgtcg cgctcgcagc 4800 
ggcgaggcag gactggtagg ggtagggggg gcatctacag gtttgtgact ccaggagaac 4860 
ggccctcggg catgttcgat tcttcggtcc tgtgtgagtg ctatgacgcg ggctgtgctt 4920 
ggtacgagct cacgcccgcc gagacctcag ttaggttgcg ggcttacctg aatacaccag 4980 
ggttacccgt ctgccaggac catctggagt tctgggagag tgtcttcaca ggcctcaccc 5040 
acatagacgc ccacttcttg tctcagacta agcaggcagg agacaacttc ccctacctgg 5100 
tagcgtacca ggctacagtg tgcgccagag cgcaggctcc acctccatca tgggatcaaa 5160 
tgtggaagtc tctcatacgg ctaaagccta cactgcacgg gccaacaccc ctgctatata 5220 
ggctagggac cgtccaaagc gagattaccc tcacacaccc agtgacaaaa tacatcatgg 5280 
catgcatgtc ggctgacctg gaggtcgtca cgagcacctg ggtgctagtg agcggagtcc 5340 
tcgcagccct ggccgcgtac tgcttgacaa caggcagcgt ggtcattgtg ggcaggatca 5400 
tcttgtccgg gaagcctgct gttattcccg acagggaagt tctttaccgg gagttcgatg 5460 
agatggagga gtgcgcctca cacctccctt acatcgaaca ggggatgcag ctcgctgagc 5520 
agttcaaaca aaaggcgctt gggctgctgc aaacggccac caagcaagcg gaggctgctg 5580 
ctcccgtggt ggagtccaag tggcgggccc ttgaggcctt ttgggcgaag cacatgtgga 5640 
acttcatcag cggaatacag tacctggcag gcttgtccac tctgcatggc aatcccgcaa 5700 
tagcatcact gatggcattt acagcctcta tcaccagccc gctcaccacc caacacaccc 5760 
tcctgtttaa catcttgggg ggatgggtgg ccgcccaact tgctccccct agtgctgctt 5820 
cggctttcgt aggtgccggc atcgctggtg cagccgtcgg cagcataggt ctcgggaagg 5880 
tgctcgttga cgtcctggcg ggttatggag caggcgtggc aggcgcactt gtggccttta 5940 
aggttatgag tggcgaggtg ccctccaccg aggacctggt caatttgctc cctgccgtcc 6000 
tctctcctgg tgccctggtc gtcggggtcg tgtgcgcagc aatactgcgt cggcacgtgg 6060 
gcccaggaga gggggctgtg cagtggatga accggctgat agcgttcgct tcgcggggta 6120 
accatgtttc ccccacgcac tatgtgcctg aaagcgacgc cgcagcacgt gtcactcaaa 6180 
tcctctccag cctcaccgtc acccaactgt tgaggaggct ccaccaatgg attaatgagg 6240 
attgctccac gccgtgctcc ggttcctggc tgagagatgt ttgggactgg atatgtacgg 6300 
tgctgactga cttcaagacc tggctccaat ccaaactcct gccgcggtta ccaggagtcc 6360 
cctttttctc gtgtcaacgc gggtacaggg gagtctggcg gggagacggc atcatgcaga 6420 
ccacctgccc atgtggagca caaatctctg gacatgtcaa aaatggttcc atgaggatcg 6480 
ttgggcctag atcctgcagc aacacgtggc atggaacatt ccccattaat gcgtacacca 6540 
cgggcccctg cacgcccgcc ccggcgccaa actactccag ggcactgtgg cgggtggctg 6600 
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ctgaggagta cgtggaggtc acgcgggtgg gggatttcca ctacgtgaca ggcatgacca 6660 
ctgacaacgt aaagtgccca tgtcaggttc cagcccccga attcttcaca gaggtggatg 6720 
gagtgcggct gcacaggtac gctccggcgt gcaaacccct cctacgggag gaggtcacat 6780 
tccaggtcgg gctcaaccag tacttggttg ggtcacagct cccgtgtgag cccgagccgg 6840 
acgtagcagt gctcacctcc atgcttaccg atccttctca catcacagca gagacggcta 6900 
agcgtaggct ggataggggg tctcccccct ccttggccag ttcttcagcc agccagttat 6960 
ctgcgccttc tttgaaggcg acttgcacta cccgtcacga ctccccagac gccggcctca 7020 
ttgaggccaa tctcctgtgg cggcaggaga tgggcgggaa catcactcgc gtggagtcag 7080 
agaacaaggt agtgattctg gactctttcg agccgcttcg agcggaagag gatgagaggg 7140 
aagtatccgt tccggcggaa atcctgcgaa aatccaggaa attccccaga gcgatgccca 7200 
tatgggcacg cccggattac aatcctccac tgctagagtc ctggaagaac ccggactacg 7260 
tccctccggt ggtacacggg tgcccattgc cgcccatcaa gggcccccca ataccacccc 7320 
cacggagaaa gaggacggtt gtcctaacag aatccaccgt gtcttctgcc ttggcggagc 7380 
tcgccacaaa gaccttcggc agctccggat catcggctgt cgacagcggt acagcgagcg 7440 
ccccccctga ccagccctct gacaacggcg acgcgggatc cgacgctgag tcgtactcct 7500 
ccatgccccc ccttgaaggg gagccggggg accccgatct cagcgacggg tcttggtcta 7560 
ccgtgagcga ggaggccagt gaggacgtcg tctgctgctc gatgtcctac acgtggacag 7620 
gcgctctgat cacaccatgc gctgcggagg agagcaaatt acccatcaat gcgctgagca 7680 
actccttgct gcgtcaccac aacatggtct atgccacaac atcccgcagt gccagtcagc 7740 
gacagaagaa ggttaccttt gacagactgc aagtcctgga cgaccactac cgggacgtgc 7800 
ttaaggagat gaaggcgaag gcgtccacag tcaaggccaa gcttctatcc gtagaagaag 7860 
cctgtaagct gacgccccca cactcggcca aatccaaatt tggctacggg gcaaaggacg 7920 
tccggaacct gtccagcaag gccgttaacc acatccgctc cgtgtggaag gacttgctgg 7980 
aagacactga gacaccaata gacaccacca tcatggcaaa gaacgaggtc ttctgcgtcc 8040 
agccagagaa gggaggccgc aagccagctc gccttatcgt gttcccggat ttgggggtcc 8100 
gtgtgtgcga gaaaatggcc ctttacgacg tggtctccac cctccctcag gccgtgatgg 8160 
gctcctcata cggattccag tactctcctg gacagcgggt cgagttcctg gtgaatgcct 8220 
ggaaatcaaa gaaaagcccc atgggcttcg catatgacac ccgctgtttt gactcaacgg 8280 
tcactgagag tgacatccgt gttgaggagt caatttacca atgttgtgac ttagtccccg 8340 
aagctagaca ggccataagg tcgctcacag agcgacttta tgtcgggggt cctctgacta 8400 
attcaaaagg gcagaactgc ggttatcgcc ggtgccgcgc gagcggcgtg ctgacgacta 8460 
gctgcggtaa cacccttaca tgttacttga aggcctctgc agcctgtcga gctgcaaagc 8520 
tccaggactg cacgatgctc gtgtgcggag acgaccttgt cgttatctgt gaaagcgcgg 8580 
gaacccaaga ggacgcggcg agcctacgag tctttacgga ggctatgact aggtattccg 8640 
ccccccccgg ggacccgccc caaccagaat acgacttgga gctgataaca tcgtgctcct 8700 
ccaatgtgtc ggtcgctcac gatgcatctg gcaaaagagt gtactacctc acccgtgacc 8760 
ccaccacccc ccttgcgcgg gctgcgtggg agacagctag acacactcca gtcaactcct 8820 
ggctaggcaa catcattatg tacgcaccca ctttgtgggc aaggatgatt ctgatgactc 8880 
acttcttctc catcctccta gcccaggagc aacttgaaaa agccctagat tgtcaaatct 8940 
acggggccta ctactccatt gagccacttg acctacctca gatcattgaa cgactccacg 9000 
gtcttagcgc attttcgctc catagttact ctccaggcga gatcaatagg gtggcttcat 9060 
gcctcagaaa acttggggta ccacccttgc gagcctggag acatcgggcc aggagtgtcc 9120 
gcgctaagct gctgtctcag gggggaaggg ccgccacttg tggcaaatac ctcttcaact 9180 
gggcagtaaa gaccaagctc aaactcactc caattccggc tgcgtcccag ttggacttgt 9240 
ccggctggtt cgttgctggc tacggcgggg gagacatata tcacagcctg tctcgcgccc 9300 
gaccccgctg gttcatgttg tgcctacccc tactctctgt aggggtaggc atcaacctgc 9360 
tccccaaccg gtgaatgggg agctaaacac tccaggccaa taggccaacc tgtttttttt 9420 
tttttttttt ttttttttcc tttttttttt tttttttttt tttttttttt ttttcctttt 9480 
tttctttttt tttttttttt tccttccttt tggtggctcc atcttagccc tagtcacggc 9540 
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tagctgtgaa aggtccgtga gccgcatgac tgcagagagt gctgatactg gcctctctgc 9600 
agatcatgt 9609 



<210> 34 
<211> 3010 
<212> PRT 

<213> Hepatitis C virus 



<400> 34 

Met Ser Thr lie Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Tyr 
1 5 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 



Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

lie Pro Lys Ala Arg Arg Pro Glu Gly Trp Ala Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro lie Asp Pro 
100 105 HO 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr 
180 185 190 



Glu Val Arg Asn Ala Ser Gly Val Tyr His Val Thr Asn Asp Cys Ser 
195 200 205 
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Asn Ser Ser lie Val Tyr Glu Ala Ala Asp lie He Met His Thr Pro 
210 215 220 



Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
225 230 235 240 

Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Leu Ser Val Pro Thr Thr 
245 250 255 

Thr He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys 
260 265 270 

Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Leu Leu Val Ser 
275 280 285 



Gin Leu Phe Thr Leu Ser Pro Arg Gin His Glu Thr Val Gin Asp Cys 
290 295 300 

A3n Cys Ser Leu Tyr Pro Gly His Val Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Leu Ser Gin 
325 330 335 

He Leu Arg He Pro Gin Thr He Val Asp Met Val Ala Gly Ala His 
340 345 350 



Trp Gly Val Leu Ala Gly He Ala Tyr Tyr Ser Met Val Gly Asn Trp 
355 360 365 

Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His 
370 375 380 

Thr Gin Val Met Gly Gly Ser Gin Ala Ser Thr He Asn Thr Leu Thr 
385 390 395 400 

Gly He Phe Ser Pro Gly Ala Lys Gin Lys He Gin Leu He Asn Thr 
405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 



Leu Asn Thr Gly Phe Leu Ala Ala Leu Phe Tyr Thr His Ser Phe Asn 
435 440 445 



Ser Ser Gly Cys Leu Glu Arg Met Ala Ser Cys Arg Pro He Asp Lys 
450 455 460 
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Phe Asp Gin Gly Trp Gly Pro lie Thr Tyr Asp Glu Gly Pro Asp Leu 
465 470 475 480 

Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Ser Cys Gly lie 
485 490 495 

Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 
500 505 510 

Pro Val Val Val Gly Thr Thr Asp Arg Thr Gly Ala Pro Thr Tyr Arg 
515 520 525 

Trp Gly Glu Asn Glu Thr Asp Val Leu He Leu Asn Asn Thr Arg Pro 
530 535 540 

Pro Gin Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe 
545 550 555 560 

Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn 
565 570 575 

Asn Thr Leu Val Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 
580 585 590 

Thr Tyr Thr Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 
595 600 605 

Val Asp Tyr Pro Tyr Arg Pro Trp His Tyr Pro Cys Thr Val Asn Phe 
610 615 620 

Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu 
625 630 635 640 

Asn Ala Ala Arg Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 
645 650 655 

Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 
660 665 670 

Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 
675 680 685 



Leu He His Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly 
690 695 700 



He Gly Ser Val Val Val Ser Leu Val He Lys- Trp Glu Tyr Val Leu 
705 710 715 720 
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Leu Ser Phe Phe Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp 
725 730 735 

Met Met Leu Leu lie Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val 
740 745 750 

) 

Val Leu Asn Ala Ala Ser Val Ala Gly Ala His Gly He Leu Thr Phe 
755 760 765 

Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro 
770 , • 775 780 

Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu 
785 790 795 800 

Leu Ala Leu Pro Pro Arg Ala Tyr Ala Met Asp Arg Glu Met Ala Ala 
805 810 815 

Ser Cys Gly Gly Ala Val Phe He Gly Leu Val Leu Leu Thr Leu Ser 
820 825 830 

Pro His Tyr Lys Val Phe Leu Ala Arg Leu He Trp Trp Leu Gin Tyr 
835 840 845 

Phe Thr Thr Arg Ala Glu Ala He Leu His Val Trp Val Pro Pro Leu 
850 855 860 

Asn Val Arg Gly Gly Arg Asp Ala Val He Leu Leu Thr Cys Ala Val 
865 870 875 880 

His Pro Asp Leu He Phe Asp He Thr Lys Leu Leu Leu Ala Val Leu 
885 890 895 



Gly Pro Leu Met Val Phe Leu Ala Gly He Thr Arg Val Pro Tyr Phe 
900 905 910 



Val Arg Ala Gin Gly Leu He Arg Ala Cys Ala Leu Ala Arg Lys Val 
915 920 925 

Ala- Gly Gly His Tyr He Gin Met Ala Leu Met Lys Leu Ala Ala Leu 
930 935 940 

Thr Gly Thr Tyr Leu Tyr Asp His Leu Thr Pro Leu Arg Asp Trp Ala 
945 950 955 960 



His Ala Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Val Phe J 
965 970 975 
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Ser Asp Met Glu Thr Lys lie lie Thr Trp Gly Ala Asp Thr Ala Ala 
980 985 990 

Cys Gly Asp lie He Leu Gly Leu Pro Val Ser Ala Arg Arg Gly Arg 
995 1000 1005 

Glu He Leu Leu Gly Pro Ala Asp Ser Leu Glu Gly Gin Gly Trp Arg 
1010 1015 1020 

Leu Leu Ala Pro He Thr Ala Tyr Ser Gin Gin Thr Arg Gly Leu Leu 
1025 1030 1035 1040 

Gly Cys He He Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu 
1045 1050 1055 

Gly Glu Val Gin Val Val Ser Thr Ala Thr Gin Ser Phe Leu Ala Thr 
1060 1065 1070 

Cys Val Asn Gly Val Cys Trp Thr Val Phe His Gly Ala Gly Ser Lys 
1075 1080 1085 

Thr Leu Ala Gly Pro Lys Gly Pro He Thr Gin Met Tyr Thr Asn Val 
1090 1095 1100 

Asp Gin Asp Leu Val Gly Trp Gin Ala Pro Pro Gly Ala Arg Ser Leu 
1105 1110 1115 H20 

Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His 
1125 1130 H35 

Ala Asp Val He Pro Val Arg Arg Arg Gly Asp Asn Arg Gly Ser Leu 
1140 1145 1150 

Leu Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro 
1155 1160 1165 

Leu Leu Cys Pro Ser Gly His Ala Val Gly He Phe Arg Ala Ala Val 
1170 1175 1180 

Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe He Pro Val Glu Ser 
1185 H90 1195 1200 

Met Glu Thr Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Ser Pro 
1205 1210 1215 



Pro Ala Val Pro Gin Thr Phe Gin Val Ala His Leu His Ala Pro Thr 
1220 1225 1230 
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Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly 
1235 1240 1245 

Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe 
1250 1255 1260 

Gly Thr Tyr Met Ser Lys Ala Tyr Gly Val Asp Pro Asn He Arg Thr 
1265 1270 1275 1280 

Gly Val Arg Thr lie Thr Thr Gly Ala Pro He Thr Tyr Ser Thr Tyr 
1285 1290 1295 

Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp He 
1300 1305 1310 

lie Met Cys Asp Glu Cys His Ser Thr Asp Ser Thr Thr Val Leu Gly 
1315 1320 1325 

He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val 
1330 1335 1340 

Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro Hi3 Pro 
1345 1350 1355 1360 

Asn He Glu Glu He Ala Leu Ser Asn Thr Gly Glu lie Pro Phe Tyr 
1365 1370 1375 

Gly Lys Ala He Pro He Glu Thr He Lys Gly Gly Arg His Leu He 
1380 1385 1390 

Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Ser 
1395 1400 1405 

Gly Leu Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser 
1410 1415 1420 

Val He Pro Thr Ser Gly Asp Val Val Val Val Ala Thr Asp Ala Leu 
1425 1430 1435 1440 

Met Thr Gly Phe Thr Gly Asp Ser Asp Ser Val He Asp Cys Asn Thr 
1445 1450 1455 

Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He 
1460 1465 1470 



Glu Thr Thr Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg 
1475 1480 1485 
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Gly Arg Thr Gly Arg Gly Arg Gly Gly He Tyr Arg Phe Val Thr Pro 
1490 1495 1500 

Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys 
1505 1510 1515 1520 

Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Ser 
1525 1530 1535 

Val Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu Pro Val Cys Gin 
1540 1545 1550 

Asp His Leu Glu Phe Trp Glu Ser Val Phe Thr Gly Leu Thr His He 
1555 1560 1565 

Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Asp Asn Phe Pro 
1570 1575 1580 

Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro 
1585 1590 1595 1600 

Pro Pro Ser Trp Asp Gin Met Trp Lys Ser Leu He Arg Leu Lys Pro 
1605 1610 1615 

Thr Leu His Gly Pro Thr Pro Leu lieu Tyr Arg Leu Gly Thr Val Gin 
. 1620 1625 1630 

Ser Glu XXe Thr Leu Thr His Pro Val Thr Lys Tyr He Met Ala Cys 
1635 1640 1645 

Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val Leu Val Ser 
1650 1655 1660 

Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val 
1665 1 670 1675 1 680 

Val He Val Gly Arg He He Leu Ser Gly Lys Pro Ala Val He Pro 
1685 1690 1695 

Asp Arg Glu Val Leu Tyr Arg Glu Phe Asp Glu Met Glu Glu Cys Ala 
1700 1705 1710 

Ser His Leu Pro Tyr He Glu Gin Gly Met Gin Leu Ala Glu Gin Phe 
1715 1720 1725 



Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Thr Ly3 Gin Ala Glu 
1730 1735 1740 
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Ala Ala Ala Pro Val Val Glu Ser Lys Trp Arg Ala Leu Glu Ala Phe 
1745 1750 1755 1760 

Trp Ala Lys His Met Trp Asn Phe lie Ser Gly lie Gin Tyr Leu Ala 
1765 1770 1775 

Gly Leu Ser Thr Leu His Gly Asn Pro Ala lie Ala Ser Leu Met Ala 
1780 1785 1790 

Phe Thr Ala Ser He Thr Ser Pro Leu Thr Thr Gin His Thr Leu Leu 
1795 1800 1805 

Phe Asn lie Leu Gly Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser 
1810 1815 1820 

Ala Ala Ser Ala Phe Val Gly Ala Gly lie Ala Gly Ala Ala Val Gly 
1825 1830 1835 1840 

Ser He Gly Leu Gly Lys Val Leu Val Asp Val Leu Ala Gly Tyr Gly 
1845 1850 *' 1855 

Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu 
I860 1865 1870 

Val Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro Ala Val Leu Ser 
1875 1880 1885 

Pro Gly Ala Leu Val Val Gly Val Val Cys Ala Ala He Leu Arg Arg 
1890 1895 1900 

His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He 
1905 1910 1915 1920 

Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro 
1925 1930 1935 

Glu Ser Asp Ala Ala Ala Arg Val Thr Gin He Leu Ser Ser Leu Thr 
1940 1945 1950 

Val Thr Gin Leu Leu Arg Arg Leu His Gin Trp He Asn Glu Asp Cys 
1955 I960 1965 

Ser Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He 
1970 1975 1980 



Cys Thr Val Leu Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu 
1985 1990 1995 2000 
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Pro Arg Leu Pro Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Arg 
2005 2010 2015 

Gly Val Trp Arg Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly 
2020 2025 2030 

Ala Gin He Ser Gly His Val Lys Asn Gly Ser Met Arg He Val Gly 
2035 2040 2045 

Pro Arg Ser Cys Ser Asn Thr Trp His Gly Thr Phe Pro lie Asn Ala 
2050 2055 2060 

Tyr Thr Thr Gly Pro Cys Thr Pro Ala Pro Ala Pro Asn Tyr Ser Arg 
2065 2070 2075 2080 

Ala Leu Trp Arg Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val 
2085 2090 2095 

Gly Asp Phe His Tyr Val Thr Gly Met Thr Thr A3p Asn Val Lys Cys 
2100 2105 2110 

Pro Cys Gin Val Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val 
2115 2120 2125 

Arg Leu His Arg Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu 
2130 2135 2140 

Val Thr Phe Gin Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu 
2145 2150 2155 2160 

Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr 
2165 2170 2175 

Asp Pro Ser His He Thr Ala Glu Thr Ala Lys Arg Arg Leu Asp Arg 
2180 2185 2190 

Gly Ser Pro Pro Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala 
2195 2200 2205 

Pro Ser Leu Lys Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala 
2210 2215 2220 

Gly Leu He Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn 
2225 2230 2235 2240 



He Thr Arg Val Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe 
2245 2250 2255 
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Glu Pro Leu Arg Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala 
2260 2265 2270 



Glu lie Leu Arg Lys Ser Arg Lys Phe Pro Arg Ala Met Pro lie Trp 
2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asn Pro 
2290 2295 2300 

Asp Tyr Val Pro Pro Val Val His Gly Cys Pro Leu Pro Pro lie Lys 
2305 2310 2315 2320 

Gly Pro Pro lie Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Thr 
2325 2330 2335 



Glu Ser Thr Val Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe 
2340 2345 2350 

Gly Ser Ser Gly Ser Ser Ala Val Asp Ser Gly Thr Ala Ser Ala Pro 
2355 2360 2365 

Pro Asp Gin Pro Ser Asp Asn Gly Asp Ala Gly Ser Asp Ala Glu Ser 
2370 2375 2380 

Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu 
2385 2390 2395 2400 

Ser Asp Gly Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val 
2405 2410 2415 



Val Cys Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu lie Thr Pro 
2420 2425 2430 

Cys Ala Ala Glu Glu Ser Lys Leu Pro lie Asn Ala Leu Ser Asn Ser 
2435 2440 2445 

Leu Leu Arg His His Asn Met Val Tyr Ala Thr Thr Ser Arg Ser Ala 
2450 2455 2460 



Ser Gin Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp 
2465 2470 2475 2480 

Asp His Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Ala Ser Thr 
2485 2490 2495 



Val Lys Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro 
2500 2505 2510 
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Pro His Ser Ala Lys Ser I»ys Phe Gly Tyr Gly Ala Lys Asp Val Arg 
2515 2520 2525 

Asn Leu Ser Ser Lys Ala Val Asn His He Arg Ser Val Trp Lys Asp 
2530 2535 2540 

Leu Leu Glu Asp Thr Glu Thr Pro He Asp Thr Thr He Met Ala Lys 
2545 2550 2555 2560 

Asn Glu Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala 
2565 2570 2575 

Arg Leu He Val Phe Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met 
2580 2585 2590 

Ala Leu Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser 
2595 2600 2605 

Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu val 
2610 2615 2620 

Asn Ala Trp Lys Ser Lys Lys Ser Pro Met Gly Phe Ala Tyr Asp Thr 
2625 2630 2635 2640 

Arg Cys Phe Asp Ser Thr Val Thr Glu Ser Asp He Arg Val Glu Glu 
2645 2650 2655 

Ser lie Tyr Gin Cy3 Cys Asp Leu Val Pro Glu Ala Arg Gin Ala He 
2660 2665 2670 

Arg Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro Leu Thr Asn Ser 
2675 2680 2685 

Lys Gly Gin Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu 
2690 2695 . 2700 

Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala 
2705 2710 2715 2720 

Ala Cys Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly 
2725 2730 2 735 

Asp Asp Leu Val Val He Cy3 Glu Ser Ala Gly Thr Gin Glu Asp Ala 
2740 2745 2750 



Ala Ser Leu Arg Val Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro 
2755 2760 2765 
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Pro Gly Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu lie Thr Ser 
2770 2775 2780 

Cys Ser Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val 
2785 2790 2795 2800 

Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp 
2805 2810 2815 

Glu Thr Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn lie lie 
2820 2825 2830 

Met Tyr Ala Pro Thr Leu Trp Ala Arg Met lie Leu Met Thr His Phe 
2835 2840 2845 

Phe Ser He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys 
2850 2855 2860 

Gin He Tyr Gly Ala Tyr Tyr Ser He Glu Pro Leu Asp Leu Pro Gin 
2865 2870 2875 2880 

lie lie Glu Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr 
2885 2890 2895 

Ser Pro Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly 
2900 2905 2910 

Val Pro Pro Leu Arg Ala Trp Arg His Arg Ala Arg Ser Val Arg Ala 
2915 2920 2925 

Lys Leu Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu 
2930 2935 2940 

Phe Asn Trp Ala Val Lys Thr Lys Leu Lys Leu Thr Pro He Pro Ala 
2945 2950 2955 2960 

Ala Ser Gin Leu Asp Leu Ser Gly Trp Phe Val Ala Gly Tyr Gly Gly 
2965 2970 2975 

Gly Asp lie Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met 
2980 2985 2990 

Leu Cys Leu Pro Leu Leu Ser Val Gly Val Gly lie Asn Leu Leu Pro 
2995 3000 3005 



Asn Arg 
3010 



